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Description 

technical Field 


5 The invention relates to materials and methodologies for managing the spread of non-A, non-B hepatitis 
virus (NANBV) infection. More specifically, it relates to diagnostic DNA fragments, diagnostic proteins, 
diagnostic antibodies and protective antigens and antibodies for an etiologic agent of NANB hepatitis, i.e., 
hepatitis C virus. 
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Background Art 

Non-A, Non-B hepatitis (NANBH) is a transmissible disease or family of diseases that are believed to be 
viral-induced, and that are distinguishable from other forms of viral-associated liver diseases, including that 
55 caused by the known hepatitis viruses, i.e., hepatitis A virus (HAV), hepatitis B virus (HBV), and delta 
hepatitis virus (HDV), as well as the hepatitis induced by cytomegalovirus (CMV) or Epstein-Barr virus 
(EBV). NANBH was first identified in transfused individuals. Transmission from man to chimpanzee and 
serial passage in chimpanzees provided evidence that NANBH is due to a transmissible infectious agent or 
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agents. However, the transmissible agent responsible for NANBH is still unidentified and the number of 
agents which are causative of the disease are unknown. 

Epidemiologic evidence is suggestive that there may be three types of NANBH: the water-borne 
epidemic type; the blood or needle associated type; and the sporadically occurring (community acquired) 

5 type. However, the number of agents which may be the causative of NANBH are unknown. 

Clinical diagnosis and identification of NANBH has been accomplished primarily by exclusion of other 
viral markers. Among the methods used to detect putative NANBV antigens and antibodies are agar-gel 
diffusion, counterimmunoelectrophoresis, immunofluorescence microscopy, immune electron microscopy, 
radioimmunoassay, and enzyme-linked immunosorbent assay. However, none of these assays has proved 

70 to be sufficiently sensitive, specific, and reproducible to be used as a diagnostic test for NANBH. 

Until now there has been neither clarity nor agreement as to the identity or specificity of the antigen 
antibody systems associated with agents of NANBH. This is due, at least in part, to the prior or co-infection 
of HBV with NANBV in individuals, and to the known complexity of the soluble and particulate antigens 
associated with HBV, as well as to the integration of HBV DNA into the genome of liver cells. In addition, 

75 there is the possibility that NANBH is caused by more than one infectious agent, as well as the possibility; 
that NANBH has been misdiagnosed. Moreover, it is unclear what the serological assays detect in the 
serum of patients with NANBH. It has been postulated that the agar-gel diffusion and counterimmunoelec- 
trophoresis assays detect autoimmune responses or non-specific protein interactions that sometimes occur 
between serum specimens, and that they do not represent specific NANBV antigen-antibody reactions. The 

20 immunofluorescence, and enzyme-linked immunosorbent, and radioimmunoassays appear to detect low 
levels of a rheumatoid-factor-like material that is frequently present in the serum of patients with NANBH as 
well as in patients with other hepatic and nonhepatic diseases. Some of the reactivity detected may. 
represent antibody to host-determined cytoplasmic antigens. 

There are a number of candidate NANBV. See, for example the reviews by Prince (1983), Feinstone 

25 and Hoofnagle (1984), and Overby (1985, 1986, 1987) and the article by Iwarsbn (1987). However, there is 
no proof that any of these candidates represent the etiological agent of NANBH. 

The demand for sensitive, specific methods for screening and identifying carriers of NANBV and 
NANBV contaminated blood or blood products is significant Post-transfusion hepatitis (PTH) occurs in 
approximately 10% of transfused patients, and NANBH accounts for up to 90% of these cases. The major 

30 problem in this disease is the frequent progression to chronic liver damage (25-55%). 

Patient care as well as the prevention of transmission of NANBH by blood and blood products or by 
close personal contact require reliable diagnostic and prognostic tools to detect nucleic acids, antigens and 
antibodies related to NANBV. In addition, there is also a need for effective vaccines and immunotherapeutic 
therapeutic agents for the prevention and/or treatment of the disease. 

35 ... . 

Disclosure of the Invention 

The invention pertains to the isolation and characterization of a newly discovered etiologic agent of 
NANBH, hepatitis C virus (HCV). More specifically, the invention provides a family of cDNA replicas of 

40 portions of HCV genome. These cDNA replicas were isolated by a technique which included a novelstep of 
screening expression products from cDNA libraries created from a particulate agent in infected tissue with 
sera from patients with NANBH to detect newly synthesized antigens derived from the genome of the 
heretofore unisolated and uncharacterized viral agent, and of selecting clones which produced products 
which reacted immunologically only with sera from infected individuals as .compared to non-infected 

45 individuals. 

Studies of the nature of the genome of the HCV, utilizing probes derived from the HCV cDNA, as well 
as sequence information contained within the HCV cDNA, are suggestive that HCV is a Flavivirus or a Flavi- 
like virus. 

Portions of the cDNA sequences derived from HCV are useful as probes to diagnose the presence of 
so virus in samples, and to isolate naturally occurring variants of the virus. These cDNAs also make available 
polypeptide sequences of HCV antigens encoded within the HCV genome(s) and permits the production of 
polypeptides which are useful as standards or reagents in diagnostic tests and/or as components of 
vaccines. Antibodies, both polyclonal and monoclonal, directed against HCV epitopes contained within these 
polypeptide sequences are also useful for diagnostic tests, as therapeutic agents, for screening of antiviral 
55 agents, and for the isolation of the NANBV agent from which these cDN As derive. In addition, by utilizing 
probes derived from these cDNAs it is possible to isolate and sequence other portions of the HCV genome, 
thus giving rise to additional probes and polypeptides which are useful in the diagnosis and/or treatment, 
both prophylactic and therapeutic, of NANBH. 
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Thus, the invention provides a polypeptide in substantially isolated form comprising a contiguous: 
sequence of at least 10 amino acids encoded by the genome hepatitis C virus (HCV) and comprising an 
antigenic determinant, wherein HCV is characterized by: 
(i) a positive stranded RNA genome; 
5 (ii) said genome comprising an open reading frame (ORF) encoding a poly protein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 amino 
acid sequence in Figure 14. 

The invention also provides a polynucleotide in substantially isolated form comprising a contiguous 
sequence of nucleotides which is capable of selectively hybridizing to the genome of hepatitis C virus 
w (HCV) or the complement thereof, wherein HCV is characterized, by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 amino 
acid sequence in Figure 14. 

75 The invention also provides a DNA polynucleotide encoding a polypeptide, which polypeptide! com- 
prises a contiguous sequence of at least 10 amino acids encoded by the genome of hepatitis C virus; (HCV) 
and comprising an antigenic determinant, wherein HCV is characterized by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

20 (iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 amino 
acid sequence in Figure 14. v 

The invention also provides a hepatitis C virus (HCV) immunoassay antigen fixed to a solid phase, 
wherein HCV is characterized by: 
(i) a positive standard RNA genome; 
25 (ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotejn comprising an amino acid sequence having at least 40% homology to the 859 amino 
acid sequence in Figure 14; 

said antigen comprising an antigenic determinant immunologically reactive with an anti-HCV antibody, 
wherein 

30 (a) said anti-HCV antibody is immunologically reactive with a reference antigenic determinant (i) encoded 
by an HCV cDNA insert in the Iambda-gt1 1 library deposited with the American Type Culture Collection 
(ATCC) under accession no. 40394, or (ii) found in Figure 47; and 

(b) said reference antigenic determinant is immunologically reactive with sera from HCV-infected : 
humans. 

35 The invention also provides an immunoassay for detecting antibody against hepatitis C virus (HCV) 
(anti-HCV antibody), which immunoassay comprises: 

(a) providing a polypeptide comprising an antigenic determinant bindable by said anti-HCV antibody, 
wherein said antigenic determinant comprises a contiguous amino acid sequence encoded by the HCV 
genome and wherein HCV is characterized by: 

40 (i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
■ amino acid sequence in Figure 14, 

(b) -incubating a biological sample with said, polypeptide under conditions: that allow for the formation of 
45 antibody-antigen complex; and 

(c) determining whether antibody-antigen complex comprising said polypeptide is formed. 

The invention further relates to: a purified HCV polynucleotide; a recombinant HCV polynucleotide; a 
recombinant polynucleotide comprising a sequence derived from an HCV genome or from HCV cDNA; a 
recombinant polynucleotide encoding an epitope of HCV; a recombinant vector containing any of the above 

so recombinant polynucleotides, and a host cell transformed with any of these vectors. 

The invention further relates to: a recombinant expression system comprising an open reading frame 
(ORF) of DNA derived from an HCV genome or from HCV cDNA, wherein the ORF is operably linked to a 
control sequence compatible with a desired host, a cell transformed with the recombinant expression 
system, and a polypeptide produced by the transformed cell. 

55 The invention can be utilized to obtain purified HCV particles, a preparation of polypeptides from the 
purified HCV; a purified HCV polypeptide; a purified polypeptide comprising an epitope which is im- 
munologically identifiable with an epitope contained in HCV. 
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The invention also relates to a recombinant HCV polypeptide; a recombinant polypeptide comprised of 
a sequence derived from an HCV genome or from HCV cDNA; a recombinant polypeptide comprised of an 
HCV epitope; and a fusion polypeptide Comprised of an HCV polypeptide. 

The invention also relates to an anti-HCV antibody composition comprising antibodies that bind said 

5 antigenic determinant of a polypeptide according to the invention which is (a) a purified preparation of 
polyclonal antibodies, or (b) a monoclonal antibody composition. 

The invention also relates to a particle which is immunogenic against HCV infection comprising a non- 
HCV polypeptide having an amino acid sequence capable of forming a particle when said sequence is 
produced in a eukaryotic host, and an HCV epitope. The invention also relates to a polynucleotide probe for 

w HCV, the probe comprising a polynucleotide of the invention which further comprises a detectable label. 
The invention also relates to a polymerase chain reaction (PC R) kit comprising a pair of primers capable of 
priming the synthesis of cDNA in a PCR reaction where each of the primers is a polynucleotide according 
to the invention. The invention also finds application in the production of kits such as those for assaying a 
sample for the presence or absence of HCV polynucleotides by (a) contacting the sample with a probe 

75 comprising a polynucleotide of the invention, for example one containing about 8 or more nucleotides, 
under conditions that allow the selective hybridization of said probe to an HCV polynucleotide or the 
compliment thereof in the sample; and (b) detecting any polynucleotide duplexes comprising said probe. 

• Other aspects to which the invention relates are: a polypeptide comprised of an HCV epitope, attached 
to a solid substrate; and an antibody to an HCV epitope, attached to a solid substrate. 

20 Still other aspects to which the invention relates are: a method for producing a polypeptide containing 
an HCV epitope comprising incubating host cells transformed with an expression vector containing a 
sequence encoding a polypeptide containing an HCv epitope under conditions which allow expression of 
said polypeptide; and a polypeptide containing an HCV epitope produced by this method. 

The invention also relates to a method for detecting HCV nucleic acids in a sample comprising reacting 

25 nucleic acids of the sample with a probe for an HCV polynucleotide under conditions which allow the 
formation of a polynucleotide duplex between the probe and the HCV nucleic acid from the sample; and 
detecting a polynucleotide duplex which contains the probe. 

Immunoassays are also included in the invention. These include an immunoassay for detecting an HCV 
antigen comprising (a) providing an antibody composition according to the invention; (b) incubating a 

30 sample with the antibody composition under conditions that allow for the formation of an antibody-antigen 
complex; and (c) detecting antibody-antigen complexes comprising the anti-HCV antibodies. The invention 
also provides an immunoassay for detecting antibodies directed against an HCV antigen comprising (a) 
providing a polypeptide comprising an antigenic determinant bindable by said anti-HCV antibody, wherein 
said antigenic determinant comprises a contiguous amino acid sequence encoded by said genome; (b) 

35 incubating a biological sample with said polypeptide under conditions that allow for the formation of an 
antibody-antigen complex; and (c) detecting antibody-antigen complexes comprising said polypeptide. 

The invention also provides vaccine compositions for treatment of HCV infection comprising an 
immunogenic peptide Containing an HCV epitope, or an inactivated preparation of HCV, or an attenuated 
preparation of HCV. 

40 An application of the invention is a tissue culture grown cell infected with HCV and the invention 
includes a method of growing HCV by providing cells, e.g. hepatocytes or macrophages, infected with HCV 
and propagating such cells in vitro. 

Yet another application of the invention is its use in a method for producing antibodies to HCV 
comprising administering to an individual an isolated immunogenic polypeptide containing an HCV epitope 

45 in an amount sufficient to produce an immune response. 

Still another application of the invention is a method for isolating cDNA derived from the genome of an 
unidentified infectious agent, comprising: (a) providing host cells transformed with expression vectors 
containing a cDNA library prepared from nucleic acids isolated from tissue infected with the agent and 
growing said host cells under conditions which allow expression of polypeptide(s) encoded in the cDNA; (b) 

so interacting the expression products of the cDNA with an antibody containing body component of an 
individual infected with said infectious agent under conditions which allow an immunoreaction, and detecting 
antibody-antigen complexes formed as a result of the interacting; (c) growing host cells which express 
polypeptides that form antibody-antigen complexes in step (b) under conditions which allow their growth as 
individual clones and isolating said clones; (d) growing cells from the clones of (c) under conditions which 

55 allow expression of polypeptide(s) encoded within the cDNA, and interacting the expression products with 
antibody containing body components of individuals other than the individual in step (a) who are infected 
with the infectious agent and with control individuals uninfected with the agent, and detecting antibody- 
antigen complexes formed as a result of the interacting; (e) growing host cells which express polypeptides 
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that form antibody-antigen complexes with antibody containing body components of infected individuals and 
individuals suspected of being infected, and not with said components of control individuals, under 
conditions which allow their growth as individual clones and isolating said clones; and (f) isolating the cDNA 
from the host cell clones of (e). 

s .... - •.. :' ■ • ' . 

Brief Description of the Drawings 

Fig. 1 shows the double-stranded nucleotide sequence of the HCV cDNA insert in clone 5-1-1, and the 
putative amino acid sequence of the polypeptide encoded therein. 
m Fig. 2 shows the homologies of the overlapping HCV cDNA sequences in clones 5-1 -1 , 81 , 1-2, and 91 . 

Fig. 3 shows a composite sequence of HCV cDNA derived from overlapping clones 81, 1-2, and 91 , and 
the amino acid sequence encoded therein. 

Fig. 4 shows the double-stranded nucleotide sequence of the HCV cDNA insert in clone 81, and the 
putative amino acid sequence of the polypeptide encoded therein. 
75 Fig. 5 shows the HCV cDNA sequence in clone 36, the segment which overlaps the NANBV cDNA of 
clone 81, and the polypeptide sequence encoded within clone 36. 

Fig. 6 shows the combined ORF of HCV cDNAs in clones 36 and 81, and the polypeptide encoded 
therein. 

Fig. 7 shows the HCV cDNA sequence in clone 32, the segment which overlaps clone 81, and the 
20 polypeptide encoded therein. 

Fig. 8 shows the HCV cDNA sequence in clone 35, the segment which overlaps clone 36, and the 
polypeptide encoded therein; 

Fig. 9 shows the combined ORF of HCV cDNAs in clones 35, 36, 81, and 32, and the polypeptide 
encoded therein. 

25 Fig. 10 shows the HCV cDNA sequence in clone 37b, the segment which overlaps clone 35, and the 
polypeptide encoded therein. 

Fig. 11 shows the HCV cDNA sequence in clone 33b, the segment which overlaps clone 32, and the 
polypeptide encoded therein. 

Fig. 12 shows the HCV cDNA sequence in clone 40b, the segment which overlaps clone 37b, and the 
30 polypeptide encoded therein. 

Fig. 13 shows the HCV cDNA sequence in clone 25c, the segment which overlaps clone 33b, and the 
polypeptide encoded therein. 

Fig. 14 shows the nucleotide sequence and polypeptide encoded therein of the ORF which; extends 
through the HCV cDNAs in clones 40b, 37b, 35, 36, 81, 32, 33b, and 25c. 
35 Fig. 1 5 shows the HCV cDNA sequence in clone 33c, the segment which overlaps clones 40b and 33c, 
and the amino acids encoded therein. 

Fig. 16 shows the HCV cDNA sequence in clone 8h, the segment which overlaps clone 33c, and the 
amino acids encoded therein. 

Fig. 17 shows the HCV cDNA sequence in clone 7e, the segment which overlaps clone 8h, and the 
40 amino acids encoded therein. 

Fig. 18 shows the HCV cDNA sequence in clone 14c, the segment which overlaps clone 25c, and the 
amino acids encoded therein. 

Fig. 19 shows the HCV cDNA sequence in clone 8f, the segment which overlaps done 14c, and the 
amino acids encoded therein: 
45 Fig. 20 shows the HCV cDNA sequence in clone 33f, the .segment which overlaps clone 8f, and the 
amino acids encoded therein. 

Fig. 21 shows the HCV cDNA sequence in clone 33g, the segment which overlaps clone 33f, and the 
amino acids encoded therein. • 

Fig. 22 shows the HCV cDNA sequence in clone 7f, the segment which overlaps the sequence in clone 
so 7e, and the amino acids encoded therein. 

Fig. 23 shows the HCV cDNA sequence in clone 11b, the segment which overlaps the sequence in 
clone 7f, arid the amino acids; encoded therein. 

Fig. 24 shows the HCV cDNA sequence in clone 14i, the segment which overlaps the sequence in clone 
11b, and the amino acids encoded therein. 
55 Fig. 25 shows the HCV cDNA sequence in clone 39c, the segment which overlaps the sequence in 
clone 33g, and the amino acids encoded therein. 

Fig. 26 shows a composite HCV cDNA sequence derived from the aligned cDNAs in clones 14i, 11b, 7f, 
7e, 8h, 33c 40b 37b 35 36, 81, 32, 33b, 25c, 14c, 8f, 33f, 33g and 39c also shown is the amino acid 
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sequence of the polypeptide encoded in the extended ORF in the derived sequence. 

Fig. 27 shows the sequence of the HCV cDNA in clone 12f, the segment which overlaps clone 14i, and 
the amino acids encoded therein. 

Fig. 28 shows the sequence of the HCV cDNA in clone 35f, the segment which overlaps clone 39c, and 
5 the amino acids encoded therein. 

Fig. 29 shows the sequence of the HCV cDNA in clone 19g, the segment which overlaps clone 35f, and 
the amino acids encoded therein. 

Fig. 30 shows the sequence of clone 26g, the segment which overlaps clone 19g, and the amino acids 
encoded therein. 

10 Fig. 31 shows the sequence of clone 15e, the segment which overlaps clone 26g, and the amino acids 
encoded therein. 

Fig. 32 shows the sequence in a composite cDNA, which was derived by aligning clones 12f through 
15e in the 5* to 3* direction; it also shows the amino acids encoded in the continuous ORF. 

Fig. 33 shows a photograph of Western blots of a fusion protein, SOD-NANBs-i-v, with chimpanzee 
75 serum from chimpanzees infected with BB-NANB, HAV, and HBV. 

Fig. 34 shows a photograph of Western blots of a fusion protein, SOD- NAN B5-1-1 , with serum from 
humans infected with NANBV, HAV, HBV, and from control humans. 

Fig. 35 is a map showing the significant features of the vector pAB24. 

Fig. 36 shows the putative amino acid sequence of the carboxy-terminus of the fusion polypeptide 
20 C1 00-3 and the nucleotide sequence encoding it. 

Fig. 37A is a photograph of a coomassie blue stained polyacrylamide gel which identifies C100-3 
expressed in yeast. 

Fig, 37B shows a Western blot of C100-3 with serum from a NANBV infected human. 

Fig, 38 shows an autoradiograph of a Northern blot of RNA isolated from the liver of a BB-NANBV 
25 infected chimpanzee, probed with BB-NANBV cDNA of clone 81. 

Fig. 39 shows an autoradiograph of NANBV nucleic acid treated with RNase A or DNase I, and probed 
with BB-NANBV cDNA of clone 81. 

Fig. 40 shows an autoradiograph of nucleic acids extracted from NANBV particles captured from 
infected plasma with anti-NANBs -1 -1 , and probed with ^P-labeled NANBV cDNA from clone 81 
30 Fig. 41a and b shows autoradiographs of filters containing isolated NANBV nucleic acids, probed with 
^P-labeled plus and minus strand DNA probes derived from NANBV cDNA in clone 81. 

Fig. 41-1 shows the homologies between a polypeptide encoded in HCV cDNA and an NS protein from 
Dengue flavivirus. 

Fig. 43 shows a histogram of the distribution of HCV infection in random samples, as determined by an 
35 ELISA screening. 

Fig. 44 shows a histogram of the distribution of HCV infection in random samples using two 
configurations of immunoglobulin-enzyme conjugate in an ELISA assay. 

Fig. 45 shows the sequences in a primer mix, derived from a conserved sequence in NS1 of 
flavivi ruses. 

40 Fig. 46 shows the HCV cDNA sequence in clone k9-1, the segment which overlaps the cDNA in Fig. 27, 
and the amino acids encoded therein. 

Fig. 47 shows the. sequence in a composite CDNA which was derived by aligning clones k9-1 through 
15e in the 5' to- 3' direction; it also shows the amino acids encoded in the continuous ORF. 

45 I. Definitions 

The term "hepatitis C virus", has been reserved by workers in the field for an heretofore unknown 
etiologic agent of NANBH. Accordingly, as used herein, "hepatitis C virus" (HCV) refers to an agent 
causitive of NANBH, which agent is a virus characterised by:(i) a positive stranded RNA genome; (ii) said 

so genome comprising an open reading frame (ORF) encoding a polyprotein; and (iii) the portion of said 
polyprotein corresponding to Figure 14 having at least 40% homology to the amino acid sequence in Figure 
14. This agent was formerly referred to as NANBV and/or BB-NANBV. The terms HCV, NANBV, and BB- 
NANBV are used interchangeably herein, but all refer to the virus as defined above. As an extension of this 
terminology, the disease caused by HCV, formerly Called NANB hepatitis (NANBH), is called hepatitis C. 

55 The terms NANBH and hepatitis C may be used interchangeably herein. 

The term "HCV", as used herein, denotes a viral species which causes NANBH, and attenuated strains 
or defective interfering particles derived therefrom. As shown infra., the HCV genome is comprised of RNA. 
It is known that RNA containing viruses have relatively high rates of spontaneous mutation, i.e., reportedly 
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on the order of 10~ 3 to 10~ 4 per nucleotide (Fields & Knipe (1986)). Therefore, there are multiple strains 
within the HCV species described infra. The compositions and methods described herein, enable the 
propagation, identification, detection, and isolation of the various related strains. Moreover, they also allow 
the preparation of diagnostics and vaccines for the various strains, and have utility in screening procedures 

5 for anti-viral agents for pharmacologic use in that they inhibit replication of HCV. 

The information provided herein, although derived from one strain of HCV, hereinafter referred to as 
CDC/HCV1, is sufficient to allow a viral taxonomist to identify other strains which fall within the species. As 
described herein, we have discovered that HCV is a Flavivirus or Flavi-like virus. The morphology and 
composition of Flavivirus particles are known, and are discussed in Brinton (1986). Generally, with respect 

w to morphology, Flaviviruses contain a central nucleocapsid surrounded by a lipid bilayer. Virions are 
spherical and have a diameter of about 40-50 nm. Their cores are about 25-30 nm in diameter. Along the 
outer surface of the virion envelope are projections that are about 5-10 nm long with terminal knobs about 2 
nm in diameter. 

HCV encodes an epitope which is immunologically identifiable with an epitope in the HCV genome from 

75 which the cDNAs described herein are derived; preferably the epitope is encoded in a cDNA described 
herein. The epitope is unique to HCV when compared to other known Flaviviruses. The uniqueness of the 
epitope may be determined by its immunological reactivity with HCV and lack of immunological reactivity 
with other Flavivirus species. Methods for determining immunological reactivity are known in the art, for 
example, by radioimmunoassay, by Elisa assay, by hemagglutination, and several examples of suitable 

20 techniques for assays are provided herein. 

In addition to the above, the following parameters are applicable, either alone or in combination, in 
identifying a strain as HCV. Since HCV strains are evolutionary related, it is expected that the overall 
homology of the genomes at the nucleotide level will be 40% or greater, preferably 60% or greater, and 
even more preferably 80% or greater; and in addition that there will be corresponding contiguous 

25 sequences of at least about 13 nucleotides. The correspondence between the putative HCV strain genomic 
sequence and the CDC/HCV1 cDNA sequence can be determined by techniques known in the art. For 
example, they can be determined by a direct comparison of the sequence information of the polynucleotide 
from the putative HCV, and the HCV cDNA sequence(s) described herein. For example, also, they can be 
determined by hybridization of the polynucleotides under conditions which form stable duplexes between 

30 homologous regions (for example, those which would be used prior to Si digestion), followed by digestion 
with single stranded specific nuclease(s), followed by size determination of the digested fragments. 

Because of the evolutionary relationship of the strains of HCV, putative HCV strains are identifiable by 
their homology at the polypeptide level. Generally, HCV strains are more than 40% homologous, preferably 
more than 60% homologous, and even more preferably more than 80% homologous at the polypeptide 

35 level. The techniques for determining amino acid sequence homology are known in the art. For example, 
the amino acid sequence may be determined directly and compared to the sequences provided herein. For 
example also, the nucleotide sequence of the genomic material of the putative HCV may be determined 
(usually via a CDNA intermediate); the amino acid sequence encoded therein can be determined, and the 
corresponding regions compared. 

40 As used herein, a polynucleotide "derived from" a designated sequence, for example, the HCV cDNA, 
particularly those exemplified in the sequences of Figs. 1-47, or from an HCV genome, refers, to a 
polynucleotide sequence which is comprised of a sequence of approximately at least 6 nucleotides, is 
preferably at least 8 nucleotides, is more preferably at least 10-12 nucleotides, and even more preferably at 
least 15-20 nucleotides corresponding, i.e., homologous to or complementary to, a region of the designated 

45 nucleotide sequence. Preferably, the sequence of the region from which the polynucleotide is derived is 
homologous to or complementary to a sequence which is unique to an HCV genome. Whether or not a 
sequence is unique to the HCV genome can be determined by techniques known to those of skill in the art. 
For example, the sequence can be compared to sequences in databanks, e.g., Genebank, to determine 
whether it is present in the uninfected host or other organisms. The sequence can also be compared to the 

so known sequences of other viral agents, including those which are known to induce hepatitis, e.g., HAV, 
HBV, and HDV, and to other members of the Flaviviridae. The correspondence or non-correspondence of 
the derived sequence to other sequences can also be determined by hybridization under the appropriate 
stringency conditions. Hybridization techniques for determining the complementarity of nucleic acid se- 
quences are known in the art, and are discussed infra. See also, for example, Maniatis et al. (1982). In 

55 addition, mismatches of duplex polynucleotides formed by hybridization can be determined by known 
techniques, including for example, digestion with a nuclease such as S1 that specifically digests single- 
stranded areas in duplex polynucleotides. Regions from which typical DNA sequences may be "derived" 
include but are not limited to, for example, regions encoding specific epitopes, as well as non-transcribed 
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and/or non-translated regions. 

The derived polynucleotide is not necessarily physically derived from the nucleotide sequence shown, 
but may be generated In any manner, including for example, chemical synthesis or DNA replication or 
reverse transcription or transcription, which are based on the information provided by the sequence of 
s bases in the region(s) from which the polynucleotide is derived. In addition, combinations of regions 
corresponding to that of the designated sequence may be modified in ways known in the art to be 
consistent with an intended use. 

Similarly, a polypeptide or amino acid sequence derived from a designated nucleic acid sequence, for 
example, the sequences in Figs. 1-47, or from an HCV genome, refers to a polypeptide having an amino 
jo acid sequence identical to that of a polypeptide encoded in the sequence, or a portion thereof wherein the 
portion consists of at least 3-5 amino acids, and more preferably at least 8-10 amino acids, and even more 
preferably at least 11-15 amino acids, or which is immunologically identifiable with a polypeptide encoded 
in the sequence. 

A recombinant or derived polypeptide is not necessarily translated from a designated nucleic acid 
J5 sequence, for example, the sequences in Figs. 1-47, or from an HCV genome; it may be generated in any 
manner, including for example, chemical synthesis, or expression of a recombinant expression system, or 
isolation from mutated HCV. 

The term "recombinant polynucleotide" as used herein intends a polynucleotide of genomic, cDNA, 
semisynthetic, or synthetic origin which, by virtue of its origin or manipulation: (1) is not associated with all 
20 or a portion of the polynucleotide with which it is associated in nature or in the form of a library; and/or (2) is 
linked to a polynucleotide other than that to which it is linked in nature. 

The term "polynucleotide" as used herein refers to a polymeric form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides. This term refers only to the primary structure of the molecule. 
Thus, this term includes double- and single-stranded DNA, as well as double- and single stranded RNA It 
25 also includes modified, for example, by methylation and/or by capping, and unmodified forms of the 
polynucleotide. 

As used herein, the term "HCV containing a sequence corresponding to a CDNA" means that the HCV 
contains a polynucleotide sequence which is homologous to or complementary to a sequence in the 
designated DNA; the degree of homology or complementarity to the cDNA will be approximately 50% or 

30 greater, will preferably be at least about 70%, and even more preferably will be at least about 90%. The 
sequences which correspond will be at least about 70 nucleotides, preferably at least about 80 nucleotides, 
and even more preferably at least about 90 nucleotides in length. The correspondence between the HCV 
sequence and the cDNA can be determined by techniques known in the art, including, for example, a direct 
comparison of the sequenced material with the cDNAs described, or hybridization and digestion with single 

35 strand nucleases, followed by size determination of the digested fragments. 

Techniques for purifying viral polynucleotides from viral particles are known in the art, and include for 
example, disruption of the particle with a chaotropic agent, and separation of the polynucleotide(s) and 
polypeptides by ion-exchange chromatography, affinity chromatography, and sedimentation according to 
density. 

40 Techniques for purifying viral polypeptides are known in the art, and examples of these techniques are 
discussed infra. 

"Recombinant host cells",: "host cells", "cells", "cell lines", "cell cultures:, and other such terms 
: denoting microorganisms or higher eukaryotic cell lines cultured as unicellular entities; refer to cells which 
can be, or have been, used as recipients for recombinant vector or other transfer DNA, and include the 
45 progeny of the original cell which has been transfected. It is understood that the progeny of a single 
parental cell may not necessarily be completely identical in morphology or in genomic or total DNA 
complement as the original parent, due to accidental or deliberate mutation. Progeny of the parental: cell 
which are sufficiently similar to the parent to be characterized by the relevant property, such as the 
presence of a nucleotide sequence encoding a desired peptide, are included in the progeny intended by 
so this definition, and are covered by the above terms. 

A "replicon" is any genetic element, e.g., a plasmid, a chromosome, a virus, that behaves as an 
autonomous unit of polynucleotide replication within a cell; i.e., capable of replication under its own control. 

A "vector" is a replicon in which another polynucleotide segment is attached, so as to bring about the 
replication and/or expression of the attached segment. 
55 "Control sequence" refers to polynucleotide sequences which are necessary to effect the expression of 
coding sequences to which they are ligated. The nature of such control sequences differs depending upon 
the host organism; in prokaryotes, such control sequences generally include promoter, ribosomal binding 
site, and terminators; in eukaryotes, generally, such control sequences include promoters, terminators and, 
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in some instances, enhancers. The term "control sequences" is intended to include, at a minimum, all 
components whose presence is necessary for expression, and may also include additional components 
whose presence is advantageous, for example, leader sequences. 

"Operably linked" refers to a juxtaposition wherein the components so described are in a relationship 

5 permitting them to function in their intended manner. A control sequence "operably linked" to a coding 
sequence is ligated in such a way that expression of the coding sequence is achieved under conditions 
compatible with the control sequences. 

An "open reading frame" (ORF) is a region of a polynucleotide sequence which encodes a polypeptide;! 
this region may represent a portion of a coding sequence or a total coding sequence. 

70 A "coding sequence" is a polynucleotide sequence which is transcribed into mRNA and/or translated 
into a polypeptide when placed under the control of appropriate regulatory sequences. The boundaries of 
the coding sequence are determined by a translation start codon at the 5'-terminus and a translation stop 
codon at the 3'-terminus. A coding sequence can include, but is not limited to mRNA, cDNA, and 
recombinant polynucleotide sequences.. 

75 "Immunologically identifiable with/as" refers to the presence of epitope(s) and polypeptides(s) which are 
also present in and are unique to the designated polypeptide(s), usually HCV proteins. Immunological 
identity may be determined by antibody binding and/or competition in binding; these techniques are known 
to those of average skill in the art, and are also illustrated infra. The uniqueness of an epitope can also be 
determined by computer searches of known data banks, e.g. Genebank, for the polynucleotide sequences 

20 which encode the epitope, and by amino acid sequence comparisons with other known proteins. 

As used herein, "epitope" refers to an antigenic determinant of a polypeptide; an epitope could 
comprise 3 amino acids in a spatial conformation which is unique to the epitope, generally an epitope 
consists of at least 5 such amino acids, and more usually, consists of at least 8-10 such amino acids. 
Methods of determining the spatial conformation of amino acids are known in the art, and include, for 

25 example, x-ray crystallography and 2-dimensional nuclear magnetic resonance. 

A polypeptide is "immunologically reactive" with an antibody when it binds to an antibody due to 
antibody recognition of a specific epitope contained within the polypeptide. Immunological reactivity may be 
determined by antibody binding, more particularly by the kinetics of antibody binding, and/or by competition 
in binding using as competitor(s) a known polypeptide(s) containing an epitope against which the antibody 

30 is directed. The techniques for determining whether a polypeptide is immunologically reactive with an 
antibody are known in the art. 

As used herein, the term "immunogenic polypeptide containing an HCV epitope" includes naturally 
occurring HCV polypeptides or fragments thereof- as well as polypeptides prepared by other means, for 
example, chemical synthesis, or the expression of the polypeptide in a recombinant organism. 

35 The term "polypeptide" refers to a molecular chain of amino acids and does not refer to a specific 
length of the product; thus, peptides, oligopeptides, and proteins are included within the definition of 
polypeptide. This term also does not refer to post-expression modifications of the polypeptide, for example, 
glycosylations, acetylations, phosphorylations and the like. 

"Transformation", as used herein, refers to the insertion of an exogenous polynucleotide into a host cell, 

40 irrespective of the. method used for the insertion, for example, direct uptake, transduction, or f-mating. The . 
exogenous polynucleotide may be maintained /as a non-integrated vector, for example, a plasmid, or 
alternatively, may be integrated into the host genome. 

"Treatment" as used herein refers to prophylaxis and/or therapy. 

An "individual", as used herein, refers to vertebrates, particularly members of the mammalian species, 
45 and includes but is not limited to domestic animals, sports animals, primates, and humans. 

As used herein, the "plus strand" of a nucleic acid contains the sequence that encodes the polypeptide. . 
The "minus strand" contains a sequence which is complementary to that of the "plus strand". 

As used herein, a "positive stranded genome" of a virus is one in which the genome, whether RNA or 
DNA, is single-stranded and which encodes a viral polypeptide(s). Examples of positive stranded RNA 
so viruses include Togaviridae, Coronaviridae, Retroviridae, Picornaviridae, and Caliciviridae. Included also, are 
the Flavivjridae, which were formerly classified as Togaviradae. See Fields & Knipe (1986). 

As used herein, "antibody containing body component" refers to a component of an individual's body 
which is a source of the antibodies of interest. Antibody containing body components are known in the art, 
and include but are not limited to, for example, plasma, serum, spinal fluid, lymph fluid, the external 
55 sections of the respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, white blood cells, and 
myelomas. 

As used herein, "purified HCV" refers to a preparation of HCV which has been isolated from the cellular 
constituents with which the virus is normally associated, and from other types of viruses which may be 
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present in the infected tissue. The techniques for isolating viruses are known to those of skill in the art, and 
include, for example, centrifugation and affinity chromatography; a method of preparing purified HCV is 
discussed infra. 

5 It. Description of the Invention 

The practice of the present invention will employ, unless otherwise indicated, conventional techniques of 
molecular biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. 
Such techniques are explained fully in the literature. See e.g., Maniatis, Fitsch & Sambrook, MOLECULAR 
w CLONING; A LABORATORY MANUAL (1982); DNA CLONING, VOLUMES I AND II (D.N Glover ed. 1985); 
OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 1984); NUCLEIC ACID HYBRIDIZATION (B.D. Hames & 
S.J. Higgins eds. 1984); 

TRANSCRIPTION AND TRANSLATION (B.D. Hames & S.J. Higgins eds. 1984); ANIMAL CELL CULTURE 
(R.I. Freshney ed. 1986); 

?5 IMMOBILIZED CELLS AND ENZYMES (IRL Press, 1986); B Perbal, A PRACTICAL GUIDE TO MOLECU- 
LAR CLONING (1984); 

the series, METHODS IN ENZYMOLOGY (Academic Press, Inc.); 

GENE TRANSFER VECTORS FOR MAMMALIAN CELLS (J.H. Miller and M.P. Calos eds. 1987, Cold 
Spring Harbor Laboratory), Methods in Enzymology Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, 

20 eds., respectively), Mayer and Walker, eds. (1987), IMMUNOCHEMICAL METHODS IN CELL AND MOLEC- 
ULAR BIOLOGY (Academic Press, London), Scopes, (1987), PROTEIN PURIFICATION: PRINCIPLES AND 
PRACTICE, Second Edition (Springer-Verlag, N.Y.), and HANDBOOK OF EXPERIMENTAL IMMUNOLOGY, 
VOLUMES l-IV (D.M. Weir and C. C. Blackwell eds 1986). 

All patents, patent applications, and publications mentioned herein, both supra and infra, are hereby 

25 incorporated herein by reference. 

The useful materials and processes of the present invention are made possible by the provision of a 
family of closely homologous nucleotide sequences isolated from a cDNA library derived from nucleic acid 
sequences present in the plasma of an HCV infected chimpanzee. This family of nucleotide sequences is 
not of human or chimpanzee origin, since it hybridizes to neither human nor chimpanzee genomic DNA 

30 from uninfected individuals, since nucleotides of this family of sequences are present only in liver and 
plasma of chimpanzees with HCV infection, and since the sequence is not present in Genebank. In addition, 
the family of sequences shows no significant homology to sequences contained within the HBV genome. 

The sequence of one member of the family, contained within clone 5-1-1, has one continuous open 
reading frame (ORF) which encodes a polypeptide of approximately 50 amino acids. Sera from HCV 

35 infected humans contain antibodies which bind to this polypeptide, whereas sera from non-infected humans 
do not contain antibodies to this polypeptide. Finally, whereas the sera from uninfected chimpanzees do not 
contain antibodies to this polypeptide, the antibodies are induced in chimpanzees following acute NANBH 
infection. Moreover, antibodies to this polypeptide are not detected in chimps and humans infected with 
HAV and HBV. By these criteria the sequence is a cDNA to a viral sequence, wherein the virus causes or is 

40 associated with NANBH; this cDN A sequence is shown in Fig. 1. As discussed infra, the cDNA sequence in 
clone5-1-1 differs from that of the other isolated cDNAs in that it contains 28 extra base pairs. 

A composite of other identified members of the cDNA family, which were isolated using as a probe a 
synthetic sequence equivalent to a fragment of the cDNA in clone 5-1-1, is shown in Fig. 3. A member of 
the cDNA family which was isolated using a synthetic sequence derived from the cDNA in clone 81 is 

45 shown in Fig. 5, and the composite of this sequence with that of clone 81 is shown in Fig. 6. Other 
members of the cDNA family, including those present in clones 12f, 14i, 11b, 7f, 7e, 8h, 33c, 40b, 37b, 35, 
36, 81, 32, 33b, 25c, 14c, 8f, 33f, 33g, 39c, 35f, 19g, 26g and 15e are described in Section IV. A. A 
composite of the cDNAs in these clones is described in Section IV.A. 19, and shown in Fig. 32. The 
composite cDNA shows that it contains one continuous ORF, and thus encodes a polyprotein. This data is 

so consistent with the suggestion, discussed infra., that HCV is a flavivirus or flavi-like virus. Clone k9-1 
overlaps the sequence of Fig. 32. A composite cDNA is shown in Fig. 47. 

The availability of this family of cDNAs shown in Figs. 1-47, inclusive, permits the construction of DNA 
probes and polypeptides useful in diagnosing NANBH due to HCV infection and in screening blood donors 
as well as donated blood and blood products for infection. For example, from the sequences it is possible to 

55 synthesize DNA oligomers of about 8-10 nucleotides, or larger, which are useful as hybridization probes to 
detect the presence of the viral genome in, for example, sera of subjects suspected of harboring the virus, 
or for screening donated blood for the presence of the virus. The family of cDNA sequences also allows the 
design and production of HCV specific polypeptides which are useful as diagnostic reagents for the 
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presence of antibodies raised during NANBH. Antibodies to purified polypeptides derived from the cDNAs 
may also be used to detect viral antigens in infected individuals and in blood. 

Knowledge of these cDNA sequences also enable the design and production of polypeptides which may 
be used as vaccines against HCV and also for the production of antibodies, which in turn may be used for 

5 protection against the disease, and/or for therapy of HCV infected individuals. 

Moreover, the family of cDNA sequences enables further" characterization of the HCV genome. 
Polynucleotide probes derived from these sequences may be used to screen cDNA libraries for additional 
overlapping cDNA sequences, which, in turn, may be used to obtain more overlapping sequences/Unless 
the genome is segmented and the segments lack common sequences, this technique may be used to gain 

w the sequence of the entire genome. However, if the genome is segmented, other segments of the genome 
can be obtained by repeating the Iambda-gt11 serological screening procedure used to isolate the cDNA 
clones described herein, or alternatively by isolating the genome from purified HCV particles. 

The family of cDNA sequences and the polypeptides derived from these sequences, as well as 
antibodies directed against these polypeptides are also useful in the isolation and identification of the BB- 

75 NANBV agent(s). For example, antibodies directed against HCV epitopes contained in polypeptides derived 
from the cDNAs may be used in processes based upon affinity chromatography to isolate the virus.; 
Alternatively, the antibodies may be used to identify viral particles isolated by other techniques. The viral 
antigens and the genomic material within the isolated viral particles may then be further characterized. 

The information obtained from further sequencing of the HCV genome(s), as well as from further 

20 characterization of the HCV antigens and characterization of the genome enables the design and synthesis 
of additional probes and polypeptides and antibodies which may be used for diagnosis, for prevention, and 
for therapy of HCV induced NANBH, and for screening for infected blood and blood-related products. 

The availability of probes for HCV, including antigens and antibodies, and polynucleotides derived from 
the genome from which the family of cDNAs is derived also allows for the development of tissue culture 

25 systems which will be of major use in elucidating the biology of HCV, This in turn, may lead to the 
development of new treatment regimens based upon antiviral compounds which preferentially inhibit the 
replication of, or infection by HCV. 

The method used to identify and isolate the etiologic agent for NANBH is novel, and it may be 
applicable to the identification and/or isolation of heretofore uncharacterized agents which contain a 

30 genome, and which are associated with a variety of diseases, including those induced by viruses, viroids, 
bacteria, fungi and parasites. In this method, a cDNA library was created from the nucleic acids present in 
infected tissue from an infected individual. The library was created in a vector which allowed the expression 
of polypeptides encoded in the cDNA. Clones of host cells containing the vector, which expressed an 
immunologically reactive fragment of a polypeptide of the etiologic agent, were selected by immunological 

35 screening of the expression products of the library with an antibody containing body component from 
another individual previously infected with the putative agent. The steps in the immunological screening 
technique included interacting the expression products of the cDNA containing vectors with the antibody 
containing body component of a second infected, individual, and detecting the formation of antibody-antigen 
complexes between the expression product(s) and antibodies of the second infected individual. The isolated ; 

40 clones are screened further immunologically by interacting their expression products with the antibody 
containing body components of other individuals infected with the putative agent and with control individuals 
uninfected with the putative agent, and detecting the formation of antigen-antibody complexes with 
antibodies from the infected individuals; and the: cDNA containing vectors which encode polypeptides which 
react immunologically with antibodies from infected individuals and individuals suspected of being infected ; 

45 with the agent, but not with control individuals are isolated. The infected individuals used for the 
construction of the cDNA library, and for the immunological screening need not be of the same species.. 

The cDNAs isolated as a result of this method, and their expression products, and antibodies directed 
against the expression products, are useful in characterizing and/or capturing the etiologic agent. As 
described in more detail infra, this method has been used successfully to isolate a family of cDNAs derived 

so from the HCV genome. 

II A. Preparation of the cDNA Sequence 

Pooled serum from a chimpanzee with chronic HCV infection and containing a high titer of the virus,,; 
55 i.e., at least 1.0* chimp infectious doses/ml (CID/ml) was used to isolate viral particles; nucleic acids isolated - 
from these particles was used as the template in the construction of a cDNA library to the viral genome. 
The procedures for isolation of putative HCV particles and for constructing the cDNA library in Iambda-gt1 1 
is discussed in Section IV.A.1. Lambda-gttl is a vector that has been developed specifically to express . 
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inserted cDNAs as fusion polypeptides with beta-galactosidase and to screen large numbers of recombinant 
phage with specific antisera raised against a defined antigen. The Iambda-gt1 1 cDNA library generated from 
a cDNA pool containing cDNA of approximate mean size of 200 base pairs was screened for encoded 
epitopes that could bind specifically with sera derived from patients who had previously experienced NANB 

s hepatitis. Huynh, T.V. et al. (1985). Approximately 10 6 phages were screened, and five positive phages were 
identified, purified, and then tested for specificity of binding to sera from different humans and chimpanzees 
previously infected with the HCV agent. One of the phages, 5-1-1, bound 5 of the 8 human sera tested. This 
binding appeared selective for sera derived from patients with prior NANB hepatitis infections since 7 
normal blood donor sera did not exhibit such binding. 

10 The sequence of the cDNA in recombinant phage 5-1-1 was determined, and is shown in Fig. 1. The 
polypeptide encoded by this cloned cDNA, which is in the same translations frame as the N-terminal beta- 
Galactosidase moiety of the fusion polypeptide is shown above the nucleotide sequence. This transnational 
ORF, therefore, encodes an epitope(s) specifically recognized by sera from patients with NANB hepatitis 
infections. 

15 The availability of the cDNA in recombinant phage 5-1-1 has allowed for the isolation of other clones 
containing additional segments and/or alternative segments of cDNA to the viral genome. The iambda-gt11 
cDNA library described supra, was screened using a synthetic polynucleotide derived from the sequence of 
the cloned 5-1-1 cDNA. This screening yielded three other clones, which were identified as 81, 1-2 and 91; 
the cDNAs contained within these clones were sequenced. See Sections IV.A.3. and IV.A.4. The homologies 

20 between the four independent clones are shown in Fig. 2, where the homologies are indicated by the 
vertical lines. Sequences of nucleotides present uniquely in clones 5-1-1, 81, and 91 are indicated by small 
letters. 

The cloned cDNAs present in recombinant phages in clones 5-1-1, 81, 1-2, and 91 are highly 
homologous, and differ in only two regions. First, nucleotide number 67 in clone 1-2 is a thymidine, whereas 
25 the other three clones contain a cytidine residue in this position. This substitution, however, does not alter 
the nature of the encoded amino acid. 

The second difference between the clones is that clone 5-1-1 contains 28 base pairs at its 5*-terminus 
which are not present in the other clones. The extra sequence may be a 5 , -terminal cloning artifact; 5'- 
terminal cloning artifacts are commonly observed in the products of cDNA methods. 
30 Synthetic sequences derived from the 5'-region and the 3'-region of the HCV cDNA in clone 81 were 
used to screen and isolate cDNAs from the Iambda-gt11 NANBV cDNA library, which overlapped clone 81 
cDNA (Section IVA5.). The sequences of the resulting cDNAs, which are in clone 36 and clone 32, 
respectively, are shown in Fig. 5 and Fig. 7. 

Similarly, a synthetic polynucleotide based on the 5*-region of clone 36 was used to screen and isolate 
35 cDNAs from the lambda gt-11 NANBV cDNA library which overlapped clone 36 cDNA (Section IV.A.8.). A 
purified clone of recombinant phage-containing cDNA which hybridized to the synthetic polynucleotide 
probe was named clone 35 and the NANBV cDNA sequence contained within this clone is shown in Fig. 8. 

By utilizing the technique of isolating overlapping cDNA sequences, clones containing additional 
upstream and downstream HCV cDNA sequences have been obtained. The isolation of these clones, is 
40 described infra in Section IV.A. 

Analysis of the nucleotide sequences of the HCV cDNAs encoded within the isolated clones show that 
the composite cDNA contains one long continuous ORF. Fig. 26 shows the sequence of the composite 
cDNA from these clones, along with the putative HCV polypeptide encoded therein. 

The description of the method to retrieve the cDNA sequences is mostly of historical interest. The 
45 resultant sequences (and their complements) are provided herein, and the sequences, or any portion 
thereof, could be prepared using synthetic methods, or by a combination of synthetic methods with retrieval 
of partial sequences using methods similar to those described herein. 

Lambda-gt11 strains replicated from the HCV cDNA library and from clones 5-1-1, 81, 1-2 and 91 have 
been deposited under the terms of the Budapest Treaty with the American Type Culture Collection (ATCC), 
so 12301 Parklawn Dr., Rockville, Maryland 20852, and have been assigned the following Accession Numbers. 
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lambda-gtn 

ATCC No. 

Deposit Date 

HCV cDNA library 

40394 

1 Dec. 1987 

clone 81 

40388 

17 Nov. 1987 

clone 91 

40389 

17 Nov. 1987 

clone 1-2 

40390 

17 Nov. 1987 

clone 5-1-1 

40391 

18 Nov. 1987 


70 


75 


The designated deposits will be maintained for a period of thirty (30) years from the date of deposit, or 
for five (5) years after the last request for the deposit; or for the enforceable life of the U.S. patent, 
whichever is longer. These deposits and other deposited materials mentioned herein are intended for 
convenience only, and are not required to practice the present invention in view of the description here. The 
HCV cDNA sequences in all of the deposited materials are incorporated herein by reference. 

The description above, of "walking" the genome by isolating overlapping cDNA sequences from the 
HCV lambda gt-11 library provides one method by which cDNAs corresponding to the entire HCV genome 
may be isolated. However, given the information provided herein, other methods for isolating these cDNAs 
are obvious to one of skill in the art. Some of these methods are described in Section IV.A., infra. 
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il.B. Preparation of Viral Polypeptides and Fragments 

The availability of cDNA sequences, either those isolated by utilizing the cDNA sequences in Figs. 1-32, 
as discussed infra, as well as the cDNA sequences in these figures, permits the construction of expression 
vectors encoding antigenically active regions of the polypeptide encoded in either strand. These antigeni- 
cally active regions may be derived from coat or envelope antigens or from core antigens, including, for 
example, polynucleotide binding proteins, polynucleotide polymerase(s), and other viral proteins required for 
the replication and/or assembly of the virus particle. Fragments encoding the desired polypeptides are 
derived from the cDNA clones using conventional restriction digestion or by synthetic methods, and are 
ligated into vectors which may, for example, contain portions of fusion sequences such as beta-Galac- 
tosidase or superoxide dismutase (SOD), preferably SOD. Methods and vectors which are useful for the 
production of polypeptides which contain fusion sequences of SOD are described in European Patent Office 
Publication number 0196056, published October 1, 1986. Vectors encoding fusion polypeptides of SOD and 
HCV polypeptides, i.e., NANB5-1-1, NANB 8 i, and C100-3, which is. encoded in a composite of HCV 
cDNAs, are described in Sections IV.B.1, IV.B.2, and IV.B.4, respectively. Any desired portion of the HCV 
cDNA containing an open reading frame, in either sense strand, can be obtained as a recombinant 
polypeptide, such as a mature or fusion protein; alternatively, a polypeptide encoded in the cDNA can be 
provided by chemical synthesis. 

The DNA encoding the desired polypeptide, whether in fused or mature form, and whether or not 
containing a signal sequence to permit secretion, may be ligated into expression vectors suitable for any 
convenient host. Both eukaryotic and prokaryotic host systems are presently used in forming recombinant 
polypeptides, and a summary of some of the more common control systems and host cell lines is given in 
Section III.A., infra. The polypeptide is then isolated from lysed cells or from the culture medium and 
purified to the extent needed for its intended use. Purification may be by techniques known in the art, for 
example, salt fractionation, chromatography on ion exchange resins, affinity chromatography, centrifugation, 
and the like. See, for example, Methods in Enzymology for a variety of methods for purifying proteins. Such 
polypeptides can be used as diagnostics, or those which give rise to neutralizing antibodies may be 
formulated into vaccines. Antibodies raised against these polypeptides can also be used as diagnostics, or 
for passive immunotherapy. In addition, as discussed in Section M.J. herein below, antibodies to these 
polypeptides are useful for isolating and identifying HCV particles. 

The HCV antigens may also be isolated from HCV virions. The virions may be grown in HCV infected 
cells in tissue culture, or in an infected host. 


I I.C. Preparation of Antigenic Polypeptides and Conjugation with Carrier 


55 


An antigenic region of a polypeptide is generally relatively small-typically 8 to 10 amino acids or less in 
length. Fragments of as few as 5 amino acids may characterize an antigenic region. These segments may 
correspond to regions of HCV antigen. Accordingly, using the cDNAs of HCV as a basis, DNAs encoding 
short segments of HCV polypeptides can be expressed recombinantly either as fusion proteins, or as 
isolated polypeptides. In addition, short amino acid sequences can be conveniently obtained by chemical 
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synthesis. In instances wherein the synthesized polypeptide is correctly configured so as to provide the 
correct epitope, but is too small to be immunogenic, the polypeptide may be linked to a suitable carrier. 

A number of techniques for obtaining such linkage are known in the art, including the formation of 
disulfide linkages using N-succinimidyl-3-(2-pyridylthio)propionate (SPDP) and succjnimidy! 4-(N-mal- 

5 eimidomethyl)cyclohexane-1-carboxylate (SMCC) obtained from Pierce Company, Rockford, Illinois, (if the 
peptide lacks a sulfhydryl group, this can be provided by addition of a cysteine residue.) These reagents 
create a disulfide linkage between themselves and; peptide cysteine residues on one protein and an amide 
linkage through the epsilonamino on a lysine, or other free amino group in the other. A variety of such 
disulfide/amide-forming agents are known. See, for example, Immun. Rev (1 982) 62:1 85. Other Afunctional 

10 coupling agents form a thioether rather than a disulfide linkage. Many of these thio-ether-forming agents are 
commercially available and include reactive esters of 6-maleimidocaproic acid, 2-bromoacetic acid, 2- 
iodoacetic acid, 4-(N-maleimidomethyl)cyclohexane-1-carboxylic acid, and the like. The carboxyl groups can 
be activated by combining them with succinimide or 1-hydroxyl-2-nitrq-4-sulfonic acid, sodium salt. The 
foregoing list is not meant to be exhaustive, and modifications of the named compounds can clearly; be 

75 used. ' ' .' 

Any carrier may be used which does not itself induce the production of antibodies harmful to the host. 
Suitable carriers are typically large, slowly metabolized macromolecules such as proteins; polysaccharides, 
such as latex functionalized sepharose, agarose, cellulose, cellulose beads and the like; polymeric amino 
acids, such as polyglutamic acid, polylysine, and the like; amino acid copolymers; and inactive virus 

20 particles, see, for example, section II.D. Especially useful protein substrates are serum albumins, keyhole 
limpet hemocyanin, immunoglobulin molecules, thyroglobulin, ovalbumin, tetanus toxoid, and other proteins 
well known to those skilled in the art. 

II.D. Preparation of Hybrid Particle Immunpgens Containing HCV Epitopes, 

25 

The immunogenicity of the epitopes of HCV may also be enhanced by preparing them in mammalian or 
yeast systems fused with or assembled with particle-forming proteins such as, for example, that associated 
with hepatitis B surface antigen. Constructs wherein the NANBV epitope is linked directly to the particle- 
forming protein coding sequences produce hybrids which are immunogenic with respect to the HCV 

30 epitope. In addition, all of the vectors prepared include epitopes specific to HBV, having various degrees of 
immunogenicity, such as, for example, the pre-S peptide; Thus, particles constructed from particle forming 
protein which include HCV sequences are immunogenic with respect to HCV and HBV. 

Hepatitis surface antigen (HBSAg) has been shown to be formed and assembled into particles in S. 
cerevisiae (Valenzuela et al. (1982)), as well as in, for example, mammalian cells (Valenzuela, P., et al. 

35 (1984)). The formation of such particles has been shown to enhance the immunogenicity of the monomer 
subunit. The constructs may also include the immunodominant epitope of HBSAg, comprising the 55 amino 
acids of the presurface (pre-S) region. Neurath et al. (1984). Constructs of the pre^S-HBSAg particle 
expressible in yeast are disclosed in EPO 174,444, published March 19, 1986; hybrids including heterolo- 
gous viral sequences for yeast expression are disclosed in EPO 175,261, published March 26, 1966. Both 

40 applications are assigned to the herein assignee, and are incorporated herein by reference. These : : 
constructs may also be expressed in mammalian cells such as Chinese hamster ovary (CHO) cells using an 
SV40-dihydrofolate reductase vector (Michelle et al. (1984)). ] 

In addition, portions of the particle-forming protein coding sequence may be replaced with codons 
encoding an HCV epitope. In this replacement, regions which are not required to mediate the aggregation of ; 

45 ■ the units to form immunogenic particles in yeast or mammals can be deleted, thus eliminating additional :; 
HBV antigenic sites from competition with the HCV epitope. 

II. E. Preparation of Vaccines 

so Vaccines may be prepared from one or more immunogenic polypeptides derived from HCV cDNA as 
well as from the cDNA sequences in the Figs. 1-32, or from the HCV genome to which they correspond. 
The observed homology between HCV and Flaviviruses provides information concerning the polypeptides 
which are likely to be most effective as vaccines, as well as the regions of the genome in which they are 
encoded. The general structure of the Flavivirus genome is discussed in Rice et al (1986). The flavivirus 

55 genomic RNA is believed to be the only virus-specific mRNA species, and it is translated into the three viral 
structural proteins, i.e., C, M, and E, as well as two large nonstructural proteins, NV4 and NV5, and a 
complex set of smaller nonstructural proteins. It is known that major neutralizing epitopes for Flaviviruses 
reside in the E (envelope) protein (Roehrig (1986)). The corresponding HCV E gene and polypeptide 
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encoding region can be predicted, based upon the homology to Raviviruses. Thus, vaccines may be : 
comprised of recombinant polypeptides ; containing epitopes of HCV E. These polypeptides may be 
expressed in bacteria, yeast, or mammalian cells, or alternatively may be isolated from viral preparations. It 
is also anticipated that the other structural proteins may also contain epitopes which give rise to protective 

5 anti-HCV antibodies. Thus, polypeptides containing the epitopes of E, C, and M may also be used, whether 
singly or in combination, in HCV vaccines. 

In addition to the above, it has been shown that immunization with NS1 (nonstructural protein 1), results 
in protection against yellow fever (Schlesinger et al (1986)). This is true even though the immunization does 
not give rise to neutralizing antibodies. Thus, particularly since this protein appears to be highly conserved 

io among Raviviruses, it is likely that HCV NS1 will also be protective against HCV infection. Moreover, it also 
shows that nonstructural proteins may provide protection against viral pathogenicity, even if they do not 
cause the production of neutralizing antibodies. 

In view of the above, multivalent vaccines against HCV may be comprised of one or more structural 
proteins, and/or one or more nonstructural proteins. These vaccines may be comprised of, for example, 

is recombinant HCV polypeptides and/or polypeptides isolated from the virions. In addition, it may be possible 
to use inactivated HCV in vaccines; inactivation may be by the preparation of viral lysat.es,. or by other 
means known in the art to cause inactivation of Raviviruses, for example, treatment with organic solvents or 
detergents, or treatment with formalin. Moreover, vaccines may also be prepared from attenuated HCV 
strains. The preparation of attenuated HCV strains is described infra. 

20 It is known that some of the proteins in Raviviruses contain highly conserved regions, thus, some 
immunological cross-reactivity is expected between HCV and other Ravi viruses. It is possible that shared : 
epitopes between the Raviviruses and HCV will give rise to protective, antibodies against one or more of the 
disorders caused by these pathogenic agents. Thus, it may be possible to design multipurpose vaccines, 
based upon this knowledge. 

25 The preparation of vaccines which contain an immunogenic polypeptide(s) as active ingredients, is 
known to one skilled in the art. Typically, such vaccines are prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid prior to injection may : 
also be prepared. The preparation may also be emulsified, or the protein encapsulated in liposomes. The 
: active immunogenic ingredients are often mixed with excipients which are pharmaceutical^ acceptable and 

30 compatible with the active ingredient. Suitable excipients are, for example, water, saline, dextrose, glycerol, 
ethanol, or the like and combinations thereof. In addition, if desired, the vaccine may contain minor amounts 
of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, and/or adjuvants which . 
enhance the effectiveness of the vaccine. Examples of adjuvants which may be effective include but are not 
limited to: aluminum hydroxide, N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor- 

35 muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as nor-MDP), N-acetylmuramylrL-alanyl-D-isoglut 
aminyl-L-alanine-a^l'^'-dipalmitoyl-snglycero-S-hydroxyphosphoryloxyVethylamine (CGP 19835A, referred 
to as MTP-PE), and RIBI, which contains three components extracted from bacteria, monophosphoryl lipid 
A, trehalose dimycolate and cell wall skeleton (MPL + TDM + CWS) in a 2% squalene/Tween 80 emulsion. 
The effectiveness of an adjuvant may be determined by measuring the amount of antibodies directed 

40 against an immunogenic polypeptide containing an HCV antigenic sequence resulting from administration of 
this polypeptide in vaccines which are also comprised of the various adjuvants. 

The vaccines are conventionally administered parenteral ly, by injection, for example, either sub- ■ 
; cutaneously or intramuscularly. Additional formulations which are suitable for other modes of administration 
include suppositories and, in some cases, oral formulations. For suppositories, traditional binders and ; 

45 carriers may include, for example, polyalkyiene glycols or triglycerides; such suppositories may be formed 
from mixtures containing the active ingredient in the range of 0.5% to 10%, preferably 1%-2%. Oral 
formulations include such normally employed excipients as, for example, pharmaceutical grades of man- : 
nitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the like. 
These compositions take the form of solutions, suspensions, tablets, pills, capsules, sustained release 

so formulations or powders and contain 10%-95% of active ingredient, preferably 25%-70%. 

The proteins may be formulated into the vaccine as neutral or salt: forms. Pharmaceutical^ acceptable 
salts include the acid addition salts {formed with free amino groups of the peptide) and which are formed 
with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids such as . 
acetic, oxalic, tartaric, maleic, and the like. Salts formed with the free carboxyl groups may also be derived 

55 from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, 
and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, nistidine, procaine, and 
the like. 
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ll.F. Dosage and Administration of Vaccines 

The vaccines are administered in a manner compatible with the dosage formulation, and in such amount 
as will be prophylactically and/or therapeutically effective. The quantity to be administered, which is 

5 generally in the range of 5 micrograms to 250 micrograms of antigen per dose, depends on the subject to 
be treated, capacity of the subject's immune system to synthesize antibodies, and the degree of protection 
desired. Precise amounts of active ingredient required to be administered may depend on the judgment of 
the practitioner and may be peculiar to each subject. 

The vaccine may be given in a single dose schedule, or preferably in a multiple dose schedule. A 

70 multiple dose schedule is one in which a primary course of vaccination may be with 1-10 separate doses, 
followed by other doses given at subsequent time intervals required to maintain and or reenforce the 
immune response, for example, at 1-4 months for a second dose, and if needed, a subsequent dose(s) after 
several months. The dosage regimen will also, at least in part, be determined by the need of the individual 
and be dependent upon the judgment of the practitioner. 

75 In addition, the vaccine containing the immunogenic HCV antigen(s) may be administered in conjunction 
with other immunoregulatory agents, for example, immune globulins. 

II.G. Preparation of Antibodies Against HCV Epitopes 

20 The immunogenic polypeptides prepared as described above are used to produce antibodies, both 
polyclonal and monoclonal. If polyclonal antibodies are desired, a selected mammal (e.g., mouse, rabbit, 
goat, horse, etc.) is immunized with an immunogenic polypeptide bearing an HCV epitope(s). Serum from 
the immunized animal is collected and treated according to known procedures. If serum containing 
polyclonal antibodies to an HCV epitope contains antibodies to other antigens, the polyclonal antibodies can 

25 be purified by immunoaffinity chromatography. Techniques for producing and processing polyclonal antisera 
are known in the art, see for example, Mayer and Walker (1 987). 

Alternatively, polyclonal antibodies may be isolated from a mammal which has been previously infected 
with HCV. An example of a method for purifying antibodies to HCV epitopes from serum from an infected 
individual, based upon affinity chromatography and utilizing a fusion polypeptide of SOD and a polypeptide 

30 encoded within cDNA clone 5-1-1, is presented in Section V.E. 

Monoclonal antibodies directed against HCV epitopes can also be readily produced by one skilled in the 
art. The general methodology for making monoclonal antibodies by hybridomas is well known. Immortal 
antibody-producing cell lines can be created by cell fusion, and also by other techniques such as direct 
transformation of B lymphocytes with oncogenic DNA, or transfection with Epstein-Barr virus. See, e.g., M. 

35 Schreier et al. (1980); Hammerling et al. (1981); Kennett et al. (1980); see also, U.S. Patent Nos. 4,341 ,761 ; 
4,399,121; 4,427,783; 4,444,887; 4,466,917; 4,472,500; 4,491,632; and~4"^93,890. Panels of monoclonal 
antibodies produced against HCV epitopes can be screened for various properties; i.e., for isotype, epitope 
affinity, etc. 

Antibodies, both monoclonal and polyclonal, which are directed against HCV epitopes are particularly 
40 useful in diagnosis, and those which are neutralizing are useful in passive immunotherapy. Monoclonal 
antibodies, in particular, may be used to raise anti-idiotype antibodies. 

Anti-idiotype antibodies are immunoglobulins which carry an "internal image" of the antigen of the 
infectious agent against which protection is desired. See, for example, Nisonoff, A., et a!. (1981) and 
Dreesman et al. (1985). 

45 Techniques for raising anti-idiotype antibodies are known in the art. See, for example, Grzych (1985), 
MacNamara et al. (1984), and Uytdehaag et ai. (1985). These anti-idiotype antibodies may also be useful for 
treatment of NANBH, as well as for an elucidation of the immunogenic regions of HCV antigens. 

II.H. Diagnostic Oligonucleotide Probes and Kits 

50 

Using the disclosed portions of the isolated HCV cDNAs as a basis, including those in Figs. 1-32, 
oligomers of approximately 8 nucleotides or more can be prepared, either by excision or synthetically, 
which hybridize with the HCV genome and are useful in identification of the viral agent(s), further 
characterization of the viral genome(s), as well as in detection of the virus(es) in diseased individuals. The 
55 probes for HCV polynucleotides (natural or derived) are a length which allows the detection of unique viral 
sequences by , hybridization. While 6-8 nucleotides may be a workable length, sequences of 10-12 
nucleotides are preferred, and about 20 nucleotides appears optimal. Preferably, these sequences will 
derive from regions which lack heterogeneity. These probes can be prepared using routine methods, 


19 


EP 0 318 216 B1 


including automated oligonucleotide synthetic methods. Among useful probes, for example, are the clone 5- 
1-1 and the additional clones disclosed herein, as wel! as the various oligomers useful in probing cDNA 
libraries, set forth: below. A complement to any unique portion of the HCV genome will be satisfactory. For 
use as probes, complete complementarity is desirable, though it may be unnecessary as the length of the 
5 fragment is increased. 

For use of such probes as diagnostics, the biological sample to be analyzed, such as blood or serum, is 
treated, if desired, to extract the nucleic acids contained therein. The resulting nucleic acid from the sample 
may be subjected to gel electrophoresis or other size separation techniques; alternatively, the nucleic acid 
sample may be dot blotted without size separation. The probes, are then labeled. Suitable, jabels, and 
70 methods for labeling probes are known in the art, and include, for example, radioactive labels incorporated 
by nick translation or kinasing, biotin, fluorescent probes, and cherhiluminescent probes. The nucleic acids 
extracted from the sample are then treated with the labeled probe under hybridization conditions of suitable 
stringencies. 

The probes can be made completely complementary to the HCV genome. Therefore, usually high 

75 stringency conditions are desirable in order to prevent false positives. However, conditions of high 
stringency should only be used if the probes are complementary to regions of the viral genome which lack : 
heterogeneity. The stringency of hybridization is determined by a number of factors during hybridization 
and during the washing procedure, including temperature, ionic strength, length of time, and concentration 
of formamide. These factors are outlined in, for example, Maniatis, T. (1982). 

20 Generally, it is expected that the HCV genome sequences will be present in serum of infected 
individuals at relatively low levels, i.e., at approximately lOMO 3 sequences per ml. This level may require 
that amplification techniques be used in hybridization assays. Such techniques are known in the art. For 
example, the Enzo Biochemical Corporation "Bio-Bridge" system uses terminal deoxynucleotide transferase : 
to add unmodified 3'-poly-dT-tails to a DNA probe. The poly dT-tailed probe is hybridized to the target 

25 nucleotide sequence, and then to a biotin-modified poly-A. PCT application 84/03520 and [ EPA1 24221 
describe a DNA hybridization assay in which: (1) analyte is annealed to a single-stranded DNA probe that is 
complementary to an enzyme-labeled oligonucleotide; and (2) the resulting tailed duplex is hybridized to an 
enzyme-labeled oligonucleotide. EPA 204510 describes a DNA hybridization assay in which analyte DNA is 
contacted with a probe that has a tail, such as a poly-dT tail, an amplifier strand that has a sequence that 

30 hybridizes to the tail of the probe, such as a poly-A sequence, and which is capable of binding a plurality of 
labeled strands. A particularly desirable technique may first involve amplification of the target HCV 
sequences in sera approximately 10,000 fold, i.e., to approximately 10 6 sequences/ml. This may be 
accomplished, for example, by the technique of Saiki et al. (1986). The amplified sequence(s) may then be 
detected using a hybridization assay. A suitable solution phase sandwich assay which may be used with 

35 labeled polynucleotide probes, and the methods for the preparation of probes is described in EPO 225,807, 
published June 16, 1987, which is assigned to the herein assignee, and which is hereby incorporated herein 
by reference. 

The probes can be packaged into diagnostic kits. Diagnostic kits include the probe DNA, which may be 
labeled; alternatively, the probe DNA may be unlabeled and the ingredients for labeling may be included in ; 
40 the kit. The kit may also contain other suitably packaged reagents and materials needed for the particular 
hybridization protocol, for example, standards, as well as instructions for conducting the test. 

II I. Immunoassay and Diagnostic Kits 

45 Both the polypeptides which react immunologically with serum containing HCV antibodies,, for example, 
those derived from or encoded within the clones described in Section IV.A., and composites thereof, (see 
section IV.A.) arid the antibodies raised against the HCV specific epitopes in these polypeptides, see for 
example Section IV. E, are useful in immunoassays to detect presence of HCV antibodies, or the presence 
of the virus and/or viral antigens, in biological samples, including for example, blood or serum samples. : 

so Design of the immunoassays is subject to a great deal of variation, and a variety of these are known in the 
art. For example, the immunoassay may utilize one viral antigen, for example, a polypeptide derived from 
any of the clones containing HCV cDNA described in Section IV.A., or from the composite cDNAs derived 
from the cDNAs in these clones, or from the HCV genome from which the cDNA in these clones is derived; 
alternatively, the immunoassay may use a combination of viral antigens derived from these sources. It may 

55 use, for example, a monoclonal antibody directed towards a viral epitope(s), a combination of monoclonal 
antibodies directed towards one viral antigen, monoclonal antibodies directed towards different viral 
antigens, polyclonal antibodies directed towards the same viral antigen, or polyclonal antibodies directed 
towards different viral antigens. Protocols may be based, for example, upon competition, or direct reaction, 
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or sandwich type assays. Protocols may also, for example, use solid supports, or may be by im- 
munoprecipitation. Most assays involve the use of labeled antibody or polypeptide; the labels may be, for 
example, fluorescent, chemiluminescent, radioactive, or dye molecules. Assays which amplify the signals 
from the probe are also known; examples of which are assays which utilize biotin and avidin, and enzyme- 

5 labeled and mediated immunoassays, such as ELISA assays. 

The Ravivirus model for HCV allows predictions regarding the likely location of diagnostic epitopes for 
the virion structural proteins. The C f pre-M, M, and E domains are all likely to contain epitopes of significant 
potential for detecting viral antigens, and particularly for diagnosis. Similarly, domains of the nonstructural 
proteins are expected to contain important diagnostic epitopes (e.g., NS5 encoding a putative polymerase; 

w and NS1 encoding a putative complement-binding antigen). Recombinant polypeptides, or viral polypep- 
tides, which include epitopes from these specific domains may be useful for the detection of viral antibodies 
in infections blood donors and infected patients. 

In addition, antibodies directed against the E and/or M proteins can be used in immunoassays for the 
detection of viral antigens in patients with HCV caused NANBH, and in infectious blood donors. Moreover, 

75 these antibodies will be extremely useful in detecting acute-phase donors and patients. 

Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are constructed by 
packaging the appropriate materials, including the polypeptides of the invention containing HCV epitopes or 
antibodies directed against HCV epitopes in suitable containers, along with the . remaining reagents and 
materials required for the conduct of the assay, as well as a suitable set of assay instructions. 

20 

IU. Further Characterization of the HCV Genome, Virions, and Viral Antigens Using Probes Derived From 
cDNA to the Viral Genome ~ ' ~ ~~ 

The HCV cDNA sequence information in the clones described in Section IVA, as shown in Figs. 1-32, 

25 inclusive, may be used to gain further information on the sequence of the HCV genome, and for 
identification and isolation of the HCV agent, and thus will aid in its characterization including the nature of 
the genome, the structure of the viral particle, and the nature of the; antigens of which it is composed. This 
information, in turn, can lead to additional polynucleotide probes, polypeptides derived from the HCV 
genome, and antibodies directed against HCV epitopes which would be useful for the diagnosis and/or 

30 treatment of HCV caused NANBH. 

The cDNA sequence information in the above-mentioned clones is useful for the design of probes for 
the isolation of additional cDNA sequences which are derived from as yet undefined regions of the HCV 
genome(s) from which the cDNAs in clones described in Section IV.A. are derived. For example, labeled 
probes containing a sequence of approximately 8 or more nucleotides, and preferably 20 or more 

35 nucleotides, which are derived from regions close to the 5' -termini or 3 f -termini of the family of HCV cDNA 
sequences shown in Figs. 1, 3, 6, 9, 14 and 32 may be used to isolate overlapping cDNA sequences from 
HCV cDNA libraries. These sequences which overlap the cDNAs in the above-mentioned clones, but which 
also contain sequences derived from regions of the genome from which the cDNA in the above mentioned 
clones are not derived, may then be used to synthesize probes for identification of other overlapping 

40 fragments which do not necessarily overlap the cDNAs in the clones described in Section IV.A. Unless the 
HCV genome is segmented and the segments lack common sequences, it is possible to sequence the 
entire viral genome(s) utilizing the technique of isolation of overlapping cDNAs derived from the viral 
genome(s). Although it is unlikely, if the genome is a segmented genome which lacks common sequences, 
the sequence of the genome can be determined by serologically screening Iambda-gt1 1 HCV cDNA 

45 libraries, as used to isolate clone 5-1-1, sequencing cDNA isolates, and using the isolated cDNAs to isolate 
overlapping fragments, using the technique described for the isolation and sequencing of the clones 
described in Section IV.A. Alternatively, characterization of the genomic segments could be from the viral 
genome(s) isolated from purified HCV particles. Methods for purifying HCV particles and for detecting them 
during the purification procedure are described herein, infra. Procedures for isolating polynucleotide 

so genomes from viral particles are known in the art, and one procedure which may be used is shown in 
Example IV.A.1. The isolated genomic segments could then be cloned and sequenced. Thus, with the 
information provided herein, it is possible to clone and sequence the HCV genome(s) irrespective of their 
nature. 

Methods for constructing cDNA libraries are known in the art, and are discussed supra and infra; a 
55 method for the construction of HCV cDNA libraries in Iambda-gt11 is discussed infra in Section IV.A. 
However, cDNA libraries which are useful for screening with nucleic acid probes may also be constructed in 
other vectors known in the art, for example, Iambda-gt10 (Huynh et al. (1985)). The HCV derived cDNA 
detected by the probes derived from the cDNAs in Figs. 1-32, and from the probes synthesized from 
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polynucleotides derived from these cDNAs, may be isolated from the clone by digestion of the isolated 
polynucleotide with the appropriate restriction enzyme(s), and sequenced. See, for example, Section IV.A.3. 
and IV.A.4. for the techniques used for the isolation and sequencing of HCV cDNA which overlaps HCV 
cDNA in clone 5-1-1, Sections IV.A.5-1V.A.7 for the isolation and sequencing of HCV cDNA which overlaps 

5 that in clone 81 , and Section IV.A.8 and IVA9 for the isolation and sequencing of a clone which overlaps 
another clone (clone 36), which overlaps clone 81. 

The sequence information derived from these overlapping HCV cDNAs is useful for determining areas 
of homology and heterogeneity within the viral genome(s), which could indicate the presence of different 
strains of the genome, and/or of populations of defective particles. It is also useful for the design of 

70 hybridization probes to detect HCV or HCV antigens or HCV nucleic acids in biological samples, and during 
the isolation of HCV (discussed infra), utilizing the techniques described in Section II. G. Moreover, the 
overlapping cDNAs may be used to create expression vectors for polypeptides derived from the HCV 
genome(s) which also encode the polypeptides encoded in clones 5-1-1, 36, 81, 91, and 1-2, and in the 
other clones described in Section IV.A. The techniques for the creation of these polypeptides containing 

75 HCV epitopes, and for antibodies directed against HCV epitopes contained within them, as well as their 
uses, are analogous to those described for polypeptides derived from NANBV cDNA sequences contained 
within clones 5-1-1, 32, 35, 36, 1-2, 81, and 91, discussed supra and infra. 

Encoded within the family of cDNA sequences contained within clones 5-1-1, 32, 35, 36, 81, 91, 1-2, 
and the other clones described in Section IV A are antigen(s) containing epitopes which appear to be 

20 unique to HCV; i.e., antibodies directed against these antigens are absent from individuals infected with 
HAV or HBV, and from individuals not infected with HCV (see the serological data presented in Section 
IV.B.). Moreover, a comparison of the sequence information of these cDNAs with the sequences of HAV, 
HBV, HDV, and with the genomic sequences in Genebank indicates that minimal homology exists between 
these cDNAs and the polynucleotide sequences of those sources. Thus, antibodies directed against the 

25 antigens encoded within the cDNAs of these clones may be used to identify BB-NANBV particles isolated 
from infected individuals. In addition, they are also useful for the isolation of NANBH agent(s). 

HCV particles may be isolated from the sera from BB-NANBV infected individuals or from cell cultures 
by any of the methods known in the art, including for example, techniques based on size discrimination 
such as sedimentation or exclusion methods, or techniques based on density such as ultracentrifugation in 

30 density gradients, or precipitation with agents such as polyethylene glycol, or chromatography on a variety 
of materials such as anionic or cationic exchange materials, and materials which bind due to hydrophobicity, 
as well as affinity columns. During the isolation procedure the presence of HCV may be detected by 
hybridization analysis of the extracted genome, using probes derived from the HCV cDNAs described 
supra, or by immunoassay (see Section II. I.) utilizing as probes antibodies directed against HCV antigens 

35 encoded within the family of cDNA sequences shown in Figs. 1-32, and also directed against HCV antigens 
encoded within the overlapping HCV cDNA sequences discussed supra. The antibodies may be mon- 
oclonal, or polyclonal, and it may be desirable to purify the antibodies before their use in the immunoassay. 
A purification procedure for polyclonal antibodies directed against antigen(s) encoded within clone 5-1-1 is 
described in Section IV.E; analogous purification procedures may be utilized for antibodies directed against 

40 other HCV antigens. 

Antibodies directed against HCV antigens encoded within the family of cDNAs shown in Figs. 1-32, as 
well as those encoded within overlapping HCV cDNAs, which are affixed to solid supports are useful for the 
isolation of HCV by immunoaffinity chromatography. Techniques for immunoaffinity chromatography are 
known in the art, including techniques for affixing antibodies to solid supports so that they retain their 

45 immunoselective activity; the techniques may be those in which the antibodies are adsorbed to the support 
(see, for example, Kurstak in ENZYME IMMUNODIAGNOSIS, page 31-37), as well as those in which the 
antibodies are covalently linked to the support. Generally, the techniques are similar to those used for 
covalent linking of antigens to a solid support, which are generally described in Section II. C; however, 
spacer groups may be included in the bifunctional coupling agents so that the antigen binding site of the 

so antibody remains accessible. 

During the purification procedure the presence of HCV may be detected and/or verified by nucleic acid 
hybridization, utilizing as probes polynucleotides derived from the family of HCV cDNA sequences shown in 
Figs. 1-32, as well as from overlapping HCV cDNA sequences, described supra. In this case, the fractions 
are treated under conditions which would cause the disruption of viral particles, for example, with detergents 

55 in the presence of chelating agents, and the presence of viral nucleic acid determined by hybridization 
techniques described in Section II.H. Further confirmation that the isolated particles are the agents which 
induce HCV may be obtained by infecting chimpanzees with the isolated virus particles, followed by a 
determination of whether the symptoms of NANBH result from the infection. 
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Viral particles from the purified preparations may then be further characterized. The genomic nucleic 
acid has been purified. Based upon its sensitivity to RNase, and not DNase I, it appears that the virus is 
composed of an RNA genome. See Example IV.C.2., infra. The strandedness and circularity or non- 
circularity can determined by techniques known in the art, including, for example, its visualization by 

5 electron microscopy, its migration in density gradients, and its sedimentation characteristics. Based upon 
the hybridization of the captured HCV genome to the negative strands of HCV cDNAs, it appears that HCV 
may be comprised of a positive stranded RNA genome (see Section IV.H.1). Techniques such as these are 
described in, for example, METHODS IN ENZYMOlOGY. In addition, the purified nucleic acid can be 
cloned and sequenced by known techniques, including reverse transcription since the genomic material is 

;o RNA. See, for example, Maniatis (1982), and Glover (1 985). Utilizing the nucleic acid derived from the viral 
particles, it is possible to sequence the entire genome, whether or not it is segmented. 

Examination of the homology of the polypeptide encoded within the continuous ORF of combined 
clones 14i through 39c (see Fig. 26), shows that the HCV polypeptide contains regions of homology with the: 
corresponding proteins; in conserved regions of flaviviruses. An example of this is described in Section 

75 IV.H.3. This finding has many important ramifications. First, this evidence, in conjunction with the results 
which show that HCV contains a positive-stranded genome, the size of which is approximately 10,000 
nucleotides, is consistent with the suggestion that HCV is a flavivirus, or flavi-like virus. Generally, flavivirus 
virions and their genomes have a relatively consistent structure and organization, which are known. See 
Rice et al. (1986), and Brinton, MA (1988). Thus, the structural genes encoding the polypeptides C, pre-. 

20 M/M, and E may be located in the 5'-terminus of the genome upstream of clone 14i. Moreover, using the 
comparison with other flaviviruses, predictions as to the precise location of the sequences encoding these 
proteins can be made. 

Isolation of the sequences upstream of those in clone 14i may be accomplished in a number of ways 
which, given the information herein, would be obvious to one of skill in the art. For example, the genome 

25 "walking" technique, may be used to isolate other sequences which are 5' to those in clone 14i, but which 
overlap that clone; this in turn leads to the isolation of additional sequences. This technique has been amply 
demonstrated infra, in Section IV.A.. For example, also, it is known that the flaviviruses have conserved 
epitopes and regions of conserved nucleic acid sequences. Polynucleotides containing the conserved 
sequences may be used as probes which bind the HCV genome, thus allowing its isolation. In addition, 

30 these conserved sequences, in conjunction with those derived from the HCV cDNAs shown in Fig. 22, may 
be used to design primers for use in systems which amplify the genome sequences upstream of those in 
clone 14i, using polymerase chain reaction technology. An example of this is described infra. 

The structure of the HCV may also be determined and its components isolated. The morphology and 
size may be determined by, for example, electron microscopy. The identification and localization of specific 

as viral polypeptide antigens such as coat or envelope antigens, or internal antigens, such as nucleic acid 
binding proteins, core antigens, and polynucleotide polymerase(s) may also be determined by, for example, 
determining whether the antigens are present as major or minor virai components, as well as by utilizing ■ 
antibodies directed against the specific antigens encoded within isolated cDNAs as probes. This information 
is useful in the design of vaccines; for example, it may be preferable to include an exterior antigen in a 

40 vaccine preparation. Multivalent vaccines may be comprised of, for example, a polypeptide derived from the 
genome encoding a structural protein, for example, E, as well as a polypeptide from another portion of the : 
genome, for example, a nonstructural or structural polypeptide. 

U.K. Cell Culture Systems and Animal Model Systems for HCV Replication 

45 ■■ • '■ • /,■;■ •. ' " -' 

The suggestion that HCV is a flavivirus or flavi-like virus also provides information on methods for; 
growing HCV. The term "flavi-like" means that the virus shows a significant amount of homology to the 
known conserved regions of flaviviruses and that the majority of the genome is a single ORF. Methods for 
culturing flaviviruses are known to those of skill in the art ( See , for example, the reviews by Brinton (1986) 

50 and Stollar, V. (1980)). Generally, suitable cells or cell linesTor cuituring HCV may include those known to 
support Flavivirus replication, for example, the following: monkey kidney cell lines (e.g. MK2, VERO); 
porcine kidney cell lines (e.g. PS); baby hamster kidney cell lines (e.g. BHK); murine macrophage cell lines 
(e.g., P388D1, MK1, Mm1); human macrophage cell lines (e.g., U-937); human peripheral blood leukocytes; 
human adherent monocytes; hepatocytes or hepatocyte cell lines (e.g., HUH7 , HEPG2); embryos or 

55 embryonic cells (e.g., chick embryo fibroblasts); or cell lines derived from invertebrates, preferably from 
insects (e.g. drosophila cell lines), or more preferably from arthropods, for example, mosquito cell lines 
(e.g., A. Albopictus, Aedes aegypti, Cutex tritaeniorhynchus) or tick cell lines (e.g. RML-14 Dermacentor 
parumapertus). 


23 


EP 0 318 216 B1 


It is possible that primary hepatocytes can be cultured, and then infected with! HCV; or alternatively, the 
hepatocyte cultures could be derived from the livers of infected individuals (e.g., humans or chimpanzees). 
The latter case is an example of a cell which is infected in vivo being passaged in vitro . In addition, various 
immortalization methods can be used to obtain cell-lines" derived from hepatocyte cultures. For example, 

5 primary liver cultures (before and after enrichment of the hepatocyte population) may be fused to a variety 
of cells to maintain stability. For example, also, cultures may be infected with transforming viruses, or 
transfected with transforming genes in order to create permanent or semipermanent cell lines. In addition, 
for example, cells in liver cultures may be fused to established cell lines (e.g., HepG2 ). Methods for cell 
fusion are known in the art, and include, for example, the use of fusion agents such as polyethylene glycol, 

10 Sendai Virus, and Epstein-Barr virus. 

As discussed above, HCV is a Flavivirus or Flavi-like virus. Therefore, it is probable that HCV infection 
of cell lines may be accomplished by techniques known in the art for infecting cells with Flaviviruses. These 
include, for example, incubating the cells with viral preparations under conditions which allow viral entry into 
the cell. In addition, it may be possible to obtain viral production by transfecting the cells with isolated viral 

75 polynucleotides. It is known that Togavirus and Flavivirus RNAs are infectious in a variety of vertebrate cell 
lines (Pfefferkorn and Shapiro (1974)), and in a mosquito cell line (Peleg (1969)). Methods for transfecting 
tissue culture cells with RNA duplexes, positive stranded RNAs, and DNAs (including cDNAs) are known in 
the art, and include, for example, techniques which use electroporation, and precipitation with DEAE-Dextran 
or calcium phosphate. An abundant source of HCV RNA can be obtained by performing in vitro transcription 

20 of an HCV cDNA corresponding to the complete genome; Transfection with this material, or with cloned 
HCV cDNA should result in viral replication and, the in vitro propagation of the virus. 

In addition to cultured cells, animal model systems may be used for viral replication; animal systems in 
which flaviviruses are known to those of skill in the art (See, for example, the review by Monath (1986)). 
Thus, HCV replication may occur not only in chimpanzees, but also in, for example, marmosets and'; 

25 suckling mice. 

ILL. Screening for Anti-Viral Agents for HCV 

The availability of cell culture and animal model systems for HCV also makes possible screening for 

30 anti-viral agents which inhibit HCV replication, and particularly for those agents which preferentially allow 
cell growth and multiplication while inhibiting viral replication. These screening methods are known by those 
of skill in the art. Generally, the anti-viral agents are tested at a variety of concentrations, for their effect on : 
preventing viral replication in cell culture systems which support viral replication, and then for an inhibition 
of infectivity or of viral pathogenicity (and a low level of toxicity) in an animal model system. 

35 The methods and compositions provided herein for detecting HCV antigens and HCV polynucleotides 
are useful for screening of anti-viral agents in that they provide an alternative, and perhaps more sensitive 
means, for detecting the agent's effect on viral replication than the cell plaque assay or ID50 assay. For 
example, the HCV-polynucleotide probes described herein may be used to quantitate the amount of viral ; 
nucleic acid produced in a cell culture. This could be accomplished, for example, by hybridization or : ; 

40 competition hybridization of the infected cell nucleic acids with a labeled HCV-polynucleotide probe. For 
example, also, antj-HCV antibodies may be used to identify and quantitate HCV antigen(s) in the cell culture : 
utilizing the immunoassays described herein. In addition, since it may be desirable to quantitate HCV 
antigens in the infected cell culture, by a competition assay, the polypeptides encoded within the HCV 
cDNAs described herein are useful in these competition assays. Generally, a recombinant HCV polypeptide 

45 derived from the HCV cDNA would be labeled/and the inhibition of binding of this labeled polypeptide to an 
HCV polypeptide due to the antigen produced in the cell culture system would be monitored. Moreover, 
t nese techniques are particularly useful in cases where the HCV may be able to replicate in a cell line 
without causing cell death. 

so II.M. Preparation of Attenuated Strains of HCV 

In addition to the above, utilizing the tissue culture systems and/or animal model systems, it may be 
possible to isolate attenuated strains of HCV. These strains would be suitable for vaccines, or for the 
isolation of viral antigens. Attenuated strains are isolatabte after multiple passages in cell culture and/or an 
55 animal model. Detection of an attenuated strain in an infected cell or individual is achievable by techniques 
known in the art, and could include, for example, the use of antibodies to one or more epitopes encoded in 
HCV as a probe or the use of a polynucleotide containing an HCV sequence of at least about 8 nucleotides 
as a probe. Alternatively, or in addition, an attenuated strain may be constructed utilizing the genomic 
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information of HCV provided herein, and utilizing recombinant techniques. Generally, one would attempt to 
delete a region of the genome encoding, for example, a polypeptide related to pathogenicity, but which 
allows viral replication. In addition, the genome construction would allow the expression of an epitope which 
gives rise to neutralizing antibodies for HCV. The altered genome could then be utilized to transform cells 
5 which allow HCV replication, and the cells grown under conditions to allow viral replication. Attenuated HCV 
strains are useful not only for vaccine purposes, but also as sources for the commercial production of viral 
antigens, since the processing of these viruses would require less stringent protection measures for the 
employees involved in viral production and/or the production of viral products. 

io III. General Methods 

The general techniques used in extracting the genome from a virus, preparing and probing a cDNA 
library, sequencing clones, constructing expression vectors, transforming cells, performing immunological 
assays such as radioimmunoassays and ELISA assays, for growing cells in culture, and the like are known 
75 in the art and laboratory manuals are available describing these techniques. However, as a general guide, 
the following sets forth some sources currently available for such procedures, and for materials useful in 
carrying them out. 

Ill A. Hosts and Expression Control Sequences 

20 

Both prokaryotic and eukaryotic host cells may be used for expression of desired coding sequences 
when appropriate control sequences which are compatible with the designated host are used. Among 
prokaryotic hosts, E. coli is most frequently used. Expression control sequences for prokaryotes include 
promoters, optionally containing operator portions, and ribosome binding sites. Transfer vectors compatible 

25 with prokaryotic hosts are commonly derived from, for example, pBR322, a plasmid containing operons 
conferring ampicillin and tetracycline resistance, and the various pUC vectors, which also contain sequences 
conferring antibiotic resistance markers. These markers may be used to obtain successful transformants by 
selection. Commonly used prokaryotic control sequences include the Beta-lactamase (penicillinase) and 
lactose promoter systems (Chang et al. (1977)), the tryptophan (trp) promoter system (Goedde! et al. 

30 (1980)) and the lambda-derived P L promoter and N gene ribosome binding site (Shimatake et al. (1981)) 
and the hybrid tac promoter (De Boer et al. (1983)) derived from sequences of the trp and lac UV5 
promoters. The foregoing systems are particularly compatible with E. coli; if desired, other prokaryotic hosts 
such as strains of Bacillus or Pseudomonas may be used, with corresponding control sequences. 

Eukaryotic hosts include yeast and mammalian cells in culture systems. Saccharomyces cerevisiae and 

35 Saccharomyces carlsbergensis are the most commonly used yeast hosts, and are convenient fungal hosts. 
Yeast compatible vectors carry markers which permit selection of successful transformants by conferring 
prototrophy to auxotrophic mutants or resistance to heavy metals on wild-type strains. Yeast compatible 
vectors may employ the 2 micron origin of replication (Broach et al. (1983)), the combination of CEN3 and 
ARS1 or other means for assuring replication, such as sequences which will result in incorporation of an 

40 appropriate fragment into the host cell genome. Control sequences for yeast vectors are known in the art 
and include promoters for the synthesis of glycolytic enzymes (Hess et al. (1968); Holland et al. (1978)), 
including the promoter for 3 phosphoglycerate kinase (Hitzeman (1980)). Terminators may also be included, 
such as those derived from the enolase gene (Holland (1981))! Particularly useful control systems are those 
which comprise the glyceraldehyde-3 phosphate dehydrogenase (GAPDH) promoter or alcohol de- 

45 hydrogenase (ADH) regulatable promoter, terminators also derived from GAPDH, and if secretion is desired, 
leader sequence from yeast alpha factor. In addition, the transcriptional regulatory region and the transcrip- 
tional initiation region which are operably linked may be such that they are not naturally associated in the 
wild-type organism. These systems are described in detail in EPO 120,551, published October 3, 1984; 
EPO 116,201, published August 22, 1984; and EPO 164,556, published December 18, 1985, all of which are 

so assigned to the herein assignee, and are hereby incorporated herein by reference. 

Mammalian cell lines available as hosts for expression are known in the art and include many 
immortalized cell lines available from the American Type Culture Collection (ATCC), including HeLa cells, 
Chinese hamster ovary (CHO) cells, baby hamster kidney (BHK) cells, and a number, of other cell lines. 
Suitable promoters for mammalian cells are also known in the art and include viral promoters such as that 

55 from Simian Virus 40 (SV40) (Fiers (1978)), Rous sarcoma virus (RSV), adenovirus (ADV), and bovine 
papilloma virus (BPV). Mammalian cells may also require terminator sequences and poly A addition 
sequences; enhancer sequences which increase expression may also be included, and sequences which 
cause amplification of the gene may also be desirable. These sequences are known in the art. Vectors 
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suitable for replication in mammalian cells may include viral replicons, or sequences which insure 
integration of the appropriate sequences encoding NANBV epitopes into the host genome. 

III.B. Transformations 

5 

Transformation may be by any known method for introducing polynucleotides into a host cell, including, 
for example packaging the polynucleotide in a virus and transducing a host cell with the virus, and by direct 
uptake of the polynucleotide, the transformation procedure used depends upon the host to be transformed. 
For example, transformation of the E. coli host cells with Iambda-gt11 containing BB-NANBV sequences is 
w discussed in the Example section, infra. Bacterial transformation by direct uptake generally employs 
treatment with calcium or rubidium chloride (Cohen (1972); Maniatis (1982)). Yeast transformation by direct 
uptake may be carried out using the method of Hinnen et al. (1978). Mammalian transformations by direct 
uptake may be conducted using the calcium phosphate precipitation method of Graham and Van der Eb 
(1978), or the various known modifications thereof. 

15 

III.C. Vector Construction 

Vector construction employs techniques which are known in the art. Site-specific DNA cleavage is 
performed by treating with suitable restriction enzymes under conditions which generally are specified by 

20 the manufacturer of these commercially available enzymes. In general, about 1 microgram of plasmid or 
DNA sequence is cleaved by 1 unit of enzyme in about 20 microliters buffer solution by incubation of 1-2 hr 
at 37* C. After incubation with the restriction enzyme, protein is removed by phenol/chloroform extraction 
and the DNA recovered by precipitation with ethanoi. The cleaved fragments may be separated using 
polyacrylamide or agarose gel electrophoresis techniques, according to the general procedures found in 

25 Methods in Enzymology (1980) 65:499-560. 

Sticky ended cleavage fragments may be blunt ended using E. coli DNA polymerase I (Kienow) in the 
presence of the appropriate deoxynucleotide triphosphates (dNTPi) present in the mixture. Treatment with 
St nuclease may also be used, resulting in the hydrolysis of any single stranded DNA portions. 

Ligations are carried out using standard buffer and temperature conditions using T4 DNA ligase and 

30 ATP; sticky end ligations require less ATP and less ligase than blunt end ligations. When vector fragments 
are used as part of a ligation mixture, the vector fragment is often treated with bacterial alkaline 
phosphatase (BAP) or calf intestinal alkaline phosphatase to remove the 5'-phosphate and thus prevent , 
religatlon of the vector; alternatively, restriction enzyme digestion of unwanted fragments can. be used to 
prevent ligation. 

35 Ligation mixtures are transformed into suitable cloning hosts, such as E. coli, and successful transfor- 
mants selected by, for example, antibiotic resistance, and screened for the correct construction. : 

IH.D. Construction of Desired DNA Sequences 

40 Synthetic oligonucleotides may be prepared using an automated oligonucleotide synthesizer as de- 
scribed by Warner (1984). If desired the synthetic strands may be labeled with 32 P by treatment with 
polynucleotide kinase in the presence of 32 P-ATP, using standard conditions for the reaction; 

DNA sequences, including those isolated from cDNA libraries, may be modified by known techniques, 
including, for example site directed mutagenesis, as described by Zoller (1982). Briefly, the DNA to be 

45 modified is packaged into phage as a single stranded sequence, and converted to a doubie stranded DNA 
with DNA polymerase using, as a primer, a synthetic oligonucleotide complementary to the portion of the - 
DNA to be modified, and having the desired modification included in its own sequence. The resulting ;, 
double stranded DNA is transformed into a phage supporting host bacterium. Cultures of the transformed ; 
bacteria, which contain replications of each strand of the phage, are plated in agar to obtain plaques. 

so Theoretically, 50% of the new plaques contain phage having the mutated sequence, and the remaining 50% 
have the original sequence. Replicates of the plaques are hybridized to labeled synthetic probe at 
temperatures and conditions which permit hybridization with the correct strand, but not with the unmodified 
sequence. The sequences which have been identified by hybridization are recovered and cloned. 

55 III.E. Hybridization with Probe 

DNA libraries may be probed using the procedure of Grunstein and Hogness (1975). Briefly, in this 
procedure, the DNA to be probed is immobilized on nitro-cellulose filters, denatured, and prehybridized with 
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a buffer containing 0-50% formamide, 0.75 M NaCI, 75 mM Na citrate, 0.02% (wt/v) each of bovine serum 
albumin, polyvinyl pyrollidone, and Ficoll, 50 mM Na Phosphate (pH 6.5), 0.1% SDS, and 100 
microg rams/ml carrier denatured DNA. The percentage of formamide in the buffer, as well as the time and 
temperature conditions of the prehybridization and subsequent hybridization steps depends on the strin- 

s gency required. Oligomeric probes which require lower stringency conditions are generally used with low 
percentages of formamide, lower temperatures, and longer hybridization times. Probes containing more than 
30 or 40 nucleotides such as those derived from cDNA or genomic sequences generally employ higher 
temperatures, e.g., about 40-42 °C, and a high percentage, e.g., 50%, formamide. Following prehybridiza- 
tion, 5'- 32 P-labeled oligonucleotide probe is added to the buffer, and the filters are incubated in this mixture 

w under hybridization conditions. After washing, the treated filters are subjected to autoradiography to show 
the location of the hybridized probe; DNA in corresponding locations on the original agar plates is used as 
the source of the desired DNA. 

III. F. Verification of Construction and Sequencing 

For routine vector constructions, ligation mixtures are transformed into E. coli strain HB1 01 or other 
suitable host, and successful transformants selected by antibiotic resistance or other markers. Plasmids 
from the transformants are then prepared according to the method of Clewell et al, (1969), usually following 
chloramphenicol amplification (Clewell (1972)). The DNA is isolated and analyzed, usually by restriction 
20 enzyme analysis and/or sequencing. Sequencing may be by the dideoxy method of Sanger et al. (1977) as 
further described by Messing et al. (1981), or by the method of Maxam et al. (1980). Problems with band 
compression, which are sometimes observed in GC rich regions, were overcome by use of T- 
deazoguanosine according to Barr et al. (1986). 

25 III.G. Enzyme Linked Immunosorbent Assay 

The enzyme-linked immunosorbent assay (ELISA) can be used to measure either antigen or antibody 
concentrations. This method depends upon conjugation of an enzyme to either an antigen or an antibody, 
and uses the bound enzyme activity as a quantitative label.. To measure antibody, the known antigen is 

30 fixed to a solid phase (e.g., a microplate or plastic cup), incubated with test serum dilutions, washed, 
incubated with anti-immunoglobulin labeled with an enzyme, and washed again. Enzymes suitable for 
labeling are known in the art, and include, for example, horseradish peroxidase. Enzyme activity bound to 
the solid phase is measured by adding the specific substrate, and determining product formation or 
substrate utilization colorimetrically. The enzyme activity bound is a direct function of the amount of 

35 antibody bound. 

To measure antigen, a known specific antibody is fixed to the solid phase, the test material containing 
antigen is added, after an incubation the solid phase is washed, and a second enzyme-labeled antibody is 
added. After washing, substrate is added, and enzyme activity is estimated colorimetrically, and related to 
antigen concentration. 

IV. Examples 

Described below are examples of the present invention which are provided only for illustrative purposes, 
and not to limit the scope of the present invention. In light of the present disclosure, numerous embodi- 

45 ments within the scope of the claims will be apparent to those of ordinary skill in the art. The procedures 
set forth, for example, in Sections IV.A. may, if desired, be repeated but need not be, as techniques are 
available for construction of the desired nucleotide sequences based on the information provided by the 
invention. Expression is exemplified in E. coli; however, other systems are available as set forth more fully 
in Section III.A. Additional epitopes derivecTFbm the genomic structure may also be produced, and used to 

so generate antibodies as set forth below. 

IV.A. Preparation, Isolation and Sequencing of HCV cDNA 
IV.A.1. Preparation of HCV cDNA 

55 

The source of NANB agent was a plasma pool derived from a chimpanzee with chronic NANBH. The 
chimpanzee had been experimentally infected with blood from another chimpanzee with chronic NANBH 
resulting from infection with HCV in a contaminated batch of factor 8 concentrate derived from pooled 
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human sera. The chimpanzee plasma pool was made by combining many individual plasma samples 
containing high levels of alanine aminotransferase activity; this activity results from hepatic injury due to the 
HCV infection. Since 1 ml of a 10~ 6 dilution of this pooled serum given i.v. caused NANBH in another 
chimpanzee, its CID was at least 10 e /ml, i.e., it had a high infectious virus titer. 

5 A cDNA library from the high titer plasma pool was generated as follows. First, viral particles were 
isolated from the plasma; a 90 ml aliquot was diluted with 310 ml of a solution containing 50 mM Tris-HCI, 
pH 8.0, 1mM EDTA, 100 mM NaCI. Debris was removed by centrifugation for 20 min at 15,000 x g at 20*C. 
Viral particles in the resulting supernatant were then pelleted by centrifugation in a Beckman SW28 rotor at 
28,000 rpm for 5 hours at 20 • C. To release the viral genome, the particles were disrupted by suspending 

70 the pellets in 15 ml solution containing 1% sodium dodecyl sulfate (SDS), 10 mM EDTA, 10 mM Tris-HCI, 
pH 7.5, also containing 2 mg/ml proteinase k, followed by incubation at 45 • C for 90 min. Nucleic acids 
were isolated by adding 0.8 micrograms MS2 bacteriophage RNA as carrier, and extracting the mixture four 
times with a 1:1 mixture of phenohchloroform (phenol saturated with 0.5M Tris-HCI, pH 7.5, 0.1% (v/v) beta- 
mercaptoethanol, 0.1% (w/v) hydroxyquinolone, followed by extraction two times with chloroform. The 

75 aqueous phase was concentrated with 1-butanol prior to precipitation with 2.5 volumes absolute ethanol 
overnight at -20 • C. Nucleic acid was recovered by centrifugation in a Beckman SW41 rotor at 40,000 rpm 
for 90 min at 4 • C, and dissolved in water that had been treated with 0.05% (v/v) diethylpyrocarbonate and 
autoclaved. 

Nucleic acid obtained by the above procedure (<2 micrograms) was denatured with 17.5 mM 
20 CH 3 HgOH; cDNA was synthesized using this denatured nucleic acid as template, and was cloned into the 
EcoRI site of phage Iambda-gt11 using methods described by Huynh (1985), except that random primers 
replaced oligo(dT) 12-18 during the synthesis of the first cDNA strand by reverse transcriptase (Taylor et al. 
(1976)). The resulting double stranded cDNAs were fractionated according to size on a Sepharose CL-4B 
column; eluted material of approximate mean size 400, 300, 200, and 100 base-pairs were pooled into 
25 cDNA pools 1, 2, 3, and 4, respectively/The Iambda-gt11 cDNA library was generated from the cDNA in 
pool 3. 

The Iambda-gt11 cDNA library generated from pool 3 was screened for epitopes that could bind 
specifically with serum derived from a patient who had previously experienced NANBH. About 10 5 phage 
were screened with patient sera using the methods of Huynh et al. (1985), except that bound human 

30 antibody was detected with sheep anti-human Ig antisera that had been radio-labeled with 125 1. Five positive 
phages were identified and purified. The five positive phages were then tested for specificity of binding to 
sera from 8 different humans previously infected with the NANBH agent, using the same method. Four of 
the phage encoded a polypeptide that reacted immunologically with only one human serum, i.e., the one 
that was used for primary screening of the phage library. The fifth phage (5-1-1) encoded a polypeptide that 

35 reacted immunologically with 5 of 8 of the sera tested. Moreover, this polypeptide did not react im- 
munologically with sera from 7 normal blood donors. Therefore, it appears that clone 5-1-1 encodes a 
polypeptide which is specifically recognized immunologically by sera from NANB patients. 

IV.A.2. Sequences of the HCV cDNA in Recombinant Phage 5-1-1, and of the Polypeptide Encoded Within 
40 the Sequence. 

The cDNA in recombinant phage 5-1-1 was sequenced by the method of Sanger et al. (1977). 
Essentially, the cDNA was excised with EcoRI, isolated by size fractionation using gel electrophoresis. The 
EcoRI restriction fragments were subcloned into the M13 vectors, mp18 and mp19 (Messing (1983)) and 
45 sequenced using the dideoxychain termination method of Sanger et al. (1977). The sequence obtained is 
shown in Fig. 1 . 

The polypeptide encoded in Fig. 1 that is encoded in the HCV cDNA is in the same translational frame 
as the N -terminal beta-galactosidase moiety to which it is fused. As shown in Section IVA, the translational 
open reading frame (ORF) of 5-1-1 encodes epitope(s) specifically recognized by sera from patients and 
so chimpanzees with NANBH infections. 

IV.A.3. Isolation of Overlapping HCV cDNA to cDNA in Clone 5-1-1. . 

Overlapping HCV cDNA to the cDNA in clone 5-1-1 was obtained by screening the same Iambda-gt11 
55 library, created as described in Section IV.A.1,, with a synthetic polynucleotide derived from the sequence 
of the HCV cDNA in clones 5-1-1, as shown in Fig. 1. The sequence of the polynucleotide used for 
screening was: 
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5 ' -TCC CTT GCT CGA TGT ACG GTA AGT GCT GAG AGC 
ACT CTT CCA TCT CAT CGA ACT CTC GGT AGA GGA CTT CCC TGT 
5 CAG GT-3' • . 

The Iambda-gt1 1 library was screened with this probe, using the method described in Huynh (1985). 
Approximately 1 in 50,000 clones hybridized with the probe. Three clones which contained cDNAs which : 
io hybridized with the synthetic probe have been numbered 81 ,. 1-2, and 91 . 

IVA4. Nucleotide Sequences of Overlapping HCV cDNAs to cDNA in Clone 5-1-1. 

The nucleotide sequences of the three cDNAs in clones 81, 1-2, and 91 were determined essentially as 
75 in Section IV.A.2. The sequences of these clones relative to the HCV cDNA sequence in phage 5-i-1 is 
shown in Fig. 2, which shows the strand encoding the detected HCV epitope, and where the homologies in 
the nucleotide sequences are. indicated by vertical lines between the sequences. 

The sequences of the cloned HCV cDNAs are highly homologous in the overlapping regions (see Fig. : 
2). However, there are differences in two regions. Nucleotide 67 in clone 1-2 is a thymidine, whereas the 
20 other three clones contain a cytidine residue in this position. It should be noted, however, that the same 
amino acid is encoded when either C or T occupies this position. 

The second difference is that clone 5-1-1 contains 28 base pairs which are not present in the other 
three clones. These base pairs occur at the start of the cDNA sequence in 5-1-1, and are indicated by small 
'. letters. Based on radioimmunoassay data, which is discussed infra in Section IV.D., it is possible that an 
25 HCV epitope may be encoded in this 28 bp region. 

The absence of the 28 base pairs of 5-1-1 from clones 81. 1-2, and 91 may mean that the cDNA in 
these clones were derived from defective HCV genomes; alternatively, the 28 bp region could be a terminal 
artifact in clone 5-1-1. 

The sequences of small letters in the nucleotide sequence of clones 81 and 91 simply indicate that 
30 these sequences have not been found in other cDNAs because cDNAs overlapping these regions were not 
yet isolated. 

A composite HCV cDNA sequence derived from overlapping cDNAs in clones 5-1 -1 , 81 , 1 -2 and 91 is 
shown in Fig. 3. However, in this figure the unique 28 base pairs of clone 5-1-1 are omitted. The figure also 
shows the sequence of the polypeptide encoded within the ORF of the composite HCV cDNA. 

35 

IV.A.5. Isolation of Overlapping HCV cDNAs to cDNA in Clone 81 . 

The isolation of HCV cDNA sequences upstream of, and which overlap those in clone 81 cDNA was 
accomplished as follows. The Iambda-gt1 1 cDNA library prepared as described in Section IV. A. 1. was 
40 screened by hybridization with a synthetic polynucleotide probe which was homologous to a 5' terminal 
sequence of clone 81. The sequence of clone 81 is presented in Fig. 4. The: sequence of the synthetic 
polynucleotide used for screening was: 

5, CTG TCA GGT ATG ATT GCC GGC TTC CCG GAC 3\ 
. The methods were essentially as described in Huynh (1985), except that the library filters were given two 
45 washes under stringent conditions, i.e., the washes were in 5 x SSC, 0.1 % SDS at 55 • C for 30 minutes 
each. Approximately 1 in 50,000 clones hybridized with the probe. A positive recombinant phage, which 
contained cDNA which hybridized with the sequence was isolated and purified. This phage, has been 
numbered clone 36. 

Downstream cDNA sequences, which overlaps the carboxyl-end sequences in clone 81 cDNA were : 
so isolated using a procedure similar to that for the isolation of upstream cDNA sequences, except that a 
synthetic oligonucleotide probe was prepared which is homologous to a 3' terminal sequence of clone 81, 
The sequence of the synthetic polynucleotide used for screening was: 

5' TTT GGC TAG TGG TTA GTG GGC TGG TGA CAG 3' 
A positive recombinant phage, which contained cDNA which hybridized with this latter sequence was 
55 isolated and purified, and has been numbered clone 32. 
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IV.A.6. Nucleotide Sequence of HCV cDNA in Clone 36 . 

The nucleotide sequence of the cDNA in clone 36 was determined essentially as described in Section 
IV.A.2. The double-stranded sequence of this cDNA, its region of overlap with the HCV cDNA in clone 81, . 
5 and the polypeptide encoded by the ORF are shown in Fig. 5. 

The ORF in clone 36 is in the same translational frame as the HCV antigen encoded in clone 81. Thus, 
in combination, the ORFs in clones 36 and 81 encode a polypeptide that represents part of a large HCV 
antigen. The sequence of this putative HCV polypeptide and the double stranded DNA sequence encoding 
it, which is derived from the combined ORFs of the HCV cDNAs of clones 36 and 81 , is shown in Fig. 6. 

IV.A7 Nucleotide Sequences of HCV cDNA in Clone 32 

The nucleotide sequence of the cDNA in clone 32 was determined essentially as was that described in 
Section IV.A.2 for the sequence of clone 5-1-1. The sequence data indicated that the cDNA in clone 32 

75 recombinant phage was derived from two different sources. One fragment of the cDNA was comprised of 
418 nucleotides derived from the HCV genome; the other fragment was comprised of 172 nucleotides 
derived from the bacteriophage MS2 genome, which had been used: as a carrier during the preparation of 
the lambda gt11 plasma cDNA library. 

The sequence of the cDNA in clone 32 corresponding to that of the HCV genome is shown in Fig. 7. 

20 The region of the sequences that overlaps that of clone 81, and the polypeptide encoded by the ORF are 
also indicated in the figure. This sequence contains one continuous ORF that is in the same translational 
frame as the HCV antigen encoded by clone 81 . 

IY.A.8 Isolation of Overlapping HCV cDNA to cDNA in Clone 36 

25 ■■■ ' • . 

The isolation of HCV cDNA sequences upstream of, and which overlap those in clone 36 cDNA was 
accomplished as described in Section IV.A.5, for those which overlap clone 81 cDNA, except that the 
synthetic polynucleotide was based on the 5'-region of clone 36. The sequence of the synthetic poly- 
nucleotide used for screening was: : 
30 5' AAG CCA CCG TGT GCG CTA GGG CTC AAG CCC 3* 

Approximately 1 in 50,000 clones hybridized with the probe. The isolated, purified clone of recombinant 
phage which contained cDNA which hybridized to this sequence was named clone 35. 

IV.A.9 Nucleotide Sequence of HCV cDNA in Clone 35 
35 ' . 

The nucleotide sequence of the cDNA in clone 35 was determined essentially as described in Section 

IV.A.2. The sequence, its region of overlap with that of the cDNA in clone 36, and the putative polypeptide 

encoded therein, are shown in Fig. 8. 

Clone 35 apparently contains a single, continuous ORF that encodes a polypeptide in the same 
40 translational frame as that encoded by clone 36, clone 81, and clone 32. Fig. 9 shows the sequence of the 

long continuous ORF that extends through clones 35, 36, 81, and 32, along with the putative HCV 

polypeptide encoded therein. This combined sequence has been confirmed using other independent cDNA 

clones derived from the same lambda gt11 cDNA library. 

45 IV.A.1 0. Isolation of Overlapping HCV cDNA to cDNA in Clone 35 

The isolation of HCV cDNA sequences upstream of, and which overlap those in clone 35 cDNA was 
accomplished as described in Section IV.A.8, for those which overlap clone 36 cDNA, except that the : . 
synthetic polynucleotide was based on the S'-region of clone 35. the sequence of the synthetic poly- 
50 nucleotide used for screening was: 

5' CAG GAT GCT GTC TCC CGC ACT CAA CGT 3' 
Approximately 1 in 50,000 clones hybridized with the probe. The isolated, purified clone of recombinant 
phage which contained cDNA which hybridized to this sequence was named clone 37b. 

55 IV.A.1 1. Nucleotide Sequence of HCV in Clone 37b 

The nucleotide sequence of the cDNA in clone 37b was determined essentially as described in Section 
IV.A.2. The sequence, its region of overlap with that of the cDNA in clone 35, and the putative polypeptide 
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encoded therein, are shown in Fig. 10. 

The S'-terminal nucleotide of clone 35 is a T, whereas the corresponding nucleotide in clone 37b is an 
A. The cDNAs from three other independent clones which were isolated during the procedure in which 
clone 37b was isolated, described in Section IV.A.10, have also been sequenced. The cDNAs from these 
5 clones also contain an A in this position. Thus, the S'-terminal T in clone 35 may be an artefact of the 
cloning procedure. It is known that artefacts often arise at the 5'-termini of cDNA molecules. 

Clone 37b apparently contains one continuous ORF which encodes a polypeptide which is a continu- 
ation of the polypeptide encoded in the ORF which extends through the overlapping clones 35, 36, 81 and 
32. 

10 

IV.A.12 Isolation of Overlapping HCV cDNA to cDNA in Clone 32 

The isolation of HCV cDNA sequences downstream of clone 32 was accomplished as follows. First, 
clone cla was isolated utilizing a synthetic hybridization probe which was based on the nucleotide sequence 
75 of the HCV cDNA sequence in clone 32. The method was essentially that described in Section IV.A.5, 
except that the sequence of the synthetic probe was: 

5' AGT GCA GTG GAT GAA CCG GCT GAT AGC CTT 3\ 
Utilizing the nucleotide sequence from clone cla, another synthetic nucleotide was synthesized which had 
the sequence: 

20 5' TCC TGA GGC GAC TGC ACC AGT GGA TAA GCT 3*. 

Screening of the lambda gt1 1 library using the clone cla derived sequence as probe yielded approximately 
Tin 50,000 positive colonies. An isolated, purified clone which hybridized with this probe was named clone 
33b. 

25 IV.A.1 3 Nucleotide Sequence of HCV cDNA in Clone 33b 

The nucleotide sequence of the cDNA in clone 33b was determined essentially as described in Section 
IV.A.2. The sequence, its region of overlap with that of the cDNA in clone 32, and the putative polypeptide 
encoded therein, are shown in Fig. 11.. 
30 Clone 33b apparently contains one continuous ORF which is an extension of the ORFs in overlapping 
clones 37b, 35, 36, 81 and 32. The polypeptide encoded in clone 33b is in the same translational frame as 
that encoded in the extended ORF of these overlapping clones. 

IV. A. 1 4 Isolation of Overlapping HCV cDNAs to cDNA Clone 37b and to cDNA in Clone 33b 

35 

In order to isolate HCV.cDNAs which overlap the cDNAs in clone 37b and in clone 33b, the following 
synthetic oligonucleotide probes, which were derived from the cDNAs in those clones, were used to screen 
the lambda gt1 1 library, using essentially the method described in Section IV.A.3. The probes used were: 

40 

5' CAG GAT GCT GTC TCC CGC ACT CAA CGT C 3 ' 

and 

45 

5' TCC TGA GGC GAC TGC ACC AGT GGA TAA GCT 3' 

to detect colonies containing HCV cDNA sequences which overlap those in clones 37b and 33b, respec- 
50 ttvely. Approximately 1 in 50,000 colonies were detected with each probe. A clone which contained cDNA 
which was upstream of, and which overlapped the cDNA in clone 37b, was named clone 40b. A clone which 
contained cDNA which was downstream of, and which overlapped the cDNA in clone 33b was named clone 
25c. 

55 IV.A.15 Nucleotide Sequences of HCV cDNA in clone 40b and in clone 25c 

The nucleotide sequences of the cDNAs in clone 40b and in clone 25c were determined essentially as 
described in Section IV.A.2. The sequences of 40b and 25c, their regions of overlap with the cDNAs in 
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clones 37b and 33b, and the putative polypeptides encoded therein, are shown in Fig. 12 (clone 40b) and 
Fig. 13 (clone 25c). 

The 5'-terminal nucleotide of clone 40b is a G. However, trie cDNAs from five other independent clones 
which were isolated during the procedure in which clone 40b was isolated, described in Section IVA14, 
5 have also been sequenced. The cDNAs from these clones also contain a T in this position. Thus, the G may 
represent a cloning artifact (see the discussion in Section IV.A.11). 

The 5 f -terminus of clone 25c is ACT, but the sequence of this region in clone cla (sequence not 
shown), and in clone 33b is TCA. This, difference may also represent a cloning artifact, as may the 28 extra 
5Merminal nucleotides in clone 5-1-1. 
10 Clones 40b and 25c each apparently contain an ORF which is an extension of the continuous ORF. in 
the previously sequenced clones. The nucleotide sequence of the ORF extending through clones 40b, 37b, 
35, 36, 81, 32, 33b, and 25c, and the amino acid sequence of the putative polypeptide encoded therein, are 
shown in Fig. 14. In the figure, the potential artifacts have been omitted from the sequence, and instead, the 
corresponding sequences in non-S'-terminal regions of multiple overlapping clones are shown. 

75 

IV.A.16. Preparation of a Composite HCV cDNA from the cDNAs in Clones 36, 81, and 32 

The composite HCV cDNA, C100, was constructed as follows. First the cDNAs from the clones 36, 81, 
and 32 were excised with EcoRI. The EcoRI fragment of cDNA from each clone was cloned individually into 

20 the EcoRI site of the vector pGEM3-blue (Promega Biotec). The resulting recombinant vectors which : 
contained the cDNAs from clones 36, 81, and 32 were named pGEM3-blue/36, pGEM3-blue/81, and 
pGEM3-blue/32, respectively. The appropriately oriented recombinant of pGEM3-blue/81 was digested with 
Nael and Narl, and the large (~2850bp) fragment was purified and ligated with the small (~570bp) Nael/Narl ; . 
purified restriction fragment from pGEM3-blue/36, This composite of the cDNAs from clones 36 and 81 was 

25 used to generate another pGEM3-biue vector containing the continuous HCV ORF contained Within the 
overlapping cDNA within these clones. This new plasmid was then digested with Pvull and EcoRI to release 
a fragment of approximately 680bp, which was then ligated with the small (580bp) Pvu|l/EcoRI fragment 
isolated from the appropriately oriented pGEM3-blue/32 plasmid, and the composite cDNA from clones 36,. 
. 81, and 32 was ligated into the EcoRI linearized vector pSODcfl, which: is described in Section IV.B.1, and 

30 which was used to express clone 5-1-1 in bacteria. Recombinants containing the ~1270bp EcoRI fragment of 
composite HCV cDNA (C100) were selected, and the cDNA from the plasmids was excised with EcoRI and 
purified. 

IV.A.17. Isolation and Nucleotide Sequences of HCV cDNAs in Clones 14i, 11b, 7f, 7e, 8h, 33c, 14c, 8f, 33f, 
35 33g, and 39c - : : 

The HCV cDNAs in clones 14i, 11b, 7f, 7e, 8h, 33c, 14c, 8f, 33f, 33g, and 39c were isolated by the 
technique of isolating overlapping cDNA fragments from the lambda gt1 1 library of HCV cDNAs described 
in Section IV.A.1.. The technique used was essentially as described in Section IV.A.3., except that the 
40 probes used were designed from the nucleotide sequence of the last isolated clones from the 5' and the 3' 
end of the combined HCV sequences. The frequency of clones which hybridized with the probes described 
below was approximately 1 in 50,000 in each case. 

The nucleotide sequences of the HCV cDNAs in clones 14i, 7f, 7e, 8h, 33c, 14c, 8f, 33f, 33g, and 39c 
were determined essentially as described in Section 1V.A.2., except that the cDNA excised from these 
45 phages were substituted for the cDNA isolated from clone 5-1-1. 

Clone 33c was isolated using a hybridization probe based on the sequence of nucleotides in clone 40b. 
The nucleotide sequence of clone 40b is presented in Fig. 12. The nucleotide sequence of the probe used 
to isolate 33c was: 

5' ATC AGG ACC GGG GTG AGA ACA ATT ACC ACT 3* 
so The sequence of the HCV cDNA in clone 33c, and the overlap with that in clone 40b, is shown in Fig. 15, 
which also shows the amino acids encoded therein. 

Clone 8h was isolated using a probe based on the sequence of nucleotides in clone 33c. The nucleotide 
sequence of the probe was 

5' AGA GAC AAC CAT GAG GTC CCC GGT GTT C 3'. 
55 The sequence of the HCV cDNA in clone 8h, and the overlap with that in clone 33c, and the amino acids 
encoded therein, are shown in Fig. 16. 

Clone 7e was isolated using a probe based on the sequence of nucleotides in clone 8h. The nucleotide 
sequence of the probe was 
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5' TCG GAC CTT TAC CTG GTC ACG AGG CAC 3\ 
The sequence of HCV cDNA in clone 7e, the overlap with clone 8h, and the amino acids encoded therein* 
are shown in Fig. 17. 

Clone 14c was isolated with a probe based on the sequence of nucleotides in elope 25c. The sequence 
5 of clone 25c is shown in Fig. 13. The probe in the isolation of clone 14c had the sequence 

5' ACC TTC CCC ATT AAT GCC TAC ACC ACG GGC 3 1 . 
The sequence of HCV cDNA in clone 14c, its overlap with that in clone 25c, and the amino acids encoded 
therein are shown in Fig. 18. 

Clone 8f was isolated using a probe based on the sequence of nucleotides in clone 14c. The nucleotide 
10 sequence of the probe was 

5' TCC ATC TCT CAA GGC AAC TTG CAC CGC TAA 3'. 
The sequence of HCV cDNA in clone 8f, its overlap with that in clone 14c, and the amino acids encoded 
therein are shown in Fig. 19. 

Clone 33f was isolated using a probe based on the nucleotide sequence present in clone 8f. The 
ts nucleotide sequence of the probe was 

5, TCC ATG GCT GTC CGC TTC CAC CTC CAA AGT 3\ 
The sequence of HCV cDNA in clone 33f, its overlap with that in clone 8f, and the amino acids encoded 
therein are shown in Fig. 20. 

Clone 33g was isolated using a probe based on the sequence of nucleotides in clone 33f. The 
20 nucleotide sequence of the probe was 

5' GCG ACA ATA CGA CAA CAT CCT CTG AGC CCG 3\ 
The sequence of HCV cDNA in clone 33g, its overlap with that in clone 33f, and the amino acids encoded 
therein are shown in Fig. 21. 

Clone 7f was isolated using a probe based on the sequence of nucleotides in clone 7e. The nucleotide 
25 sequence of the probe was 

5' AGC AGA CAA GGG GCC TCC TAG GGT GCA TAA T 3V 
The sequence of HCV cDNA in clone 7f, its overlap with clone 7e, and the amino acids encoded therein are 
shown in Fig. 22. 

Clone 11b was isolated using a probe based on the sequence of clone 7f. The nucleotide sequence of 
30 the probe was 

5' CAC CTA TGT TTA TAA CCA TCT CAC TCC TCT 3\ 
The sequence of HCV cDNA in clone 11b, its overlap with clone 7f, and the amino acids encoded therein 
are shown in Fig. 23. 

Clone 14i was isolated using a probe based on the sequence of nucleotides in clone 11b. The 
35 nucleotide sequence of the probe was 

5' CTC TGT CAC CAT ATT ACA AGC GCT ATA TCA 3\ 
The sequence of HCV cDNA in clone 14i, its overlap with 11b, and the amino acids encoded therein are 
shown in Fig. 24. 

Clone 39c was isolated using a probe based on the sequence of nucleotides in clone 33g. The . 
40 nucleotide sequence of the probe Was 

5' CTC GTT GCT ACG TCA CCA CAA TTT GGT GTA 3' 
The sequence of HCV cDNA in clone 39c, its overlap with clone 33g, and the amino acids encoded therein ; ; 
are shown in Fig. 25. 

45 IV.A.18. The Composite HCV cDNA Sequence Derived from Isolated Clones Containing HCV cDNA 

The HCV cDNA sequences in the isolated clones described supra have been aligned to create a 
composite HCV cDNA sequence. The isolated clones, aligned in the 5' to 3' direction are: 14i, 7f, 7e, 8h, 
33c, 40b, 37b, 35, 36, 81, 32, 33b, 25c, 14c, 8f, 33f, 33g, and 39c. 

50 A composite HCV cDNA sequence derived from the isolated clones, and the amino acids encoded 
therein, is shown in Fig. 26. 

In creating the composite sequence the following sequence heterogeneities have been considered. 
Clone 33c contains an HCV cDNA of 800 base pairs, which overlaps the cDNAs in clones 40b and 37c. In 
clone 33c, as well as in 5 other overlapping clones, nucleotide #789 is a G. However, in clone 37b (see 

55 Section IV.A.11), the corresponding nucleotide is an A. This sequence difference creates an apparent 
heterogeneity in the amino acids encoded therein, which would be either CYS or TYR, for G or A, 
respectively. This heterogeneity may have important ramifications in terms of protein folding. 


33 


EP 0 318 216 B1 


Nucleotide residue #2 in clone 8h HCV cDNA is a T. However, as shown infra, the corresponding 
residue in clone 7e is an A; moreover, an A in this position is also found in 3 other isolated overlapping 
clones. Thus, the T residue in clone 8h may represent a cloning artifact. Therefore, in Fig. 26, the residue in 
this position is designated as an A. 
5 The ^-terminal nucleotide in clone 8f HCV cDNA is a G. However, the corresponding residue in clone 
33f, and in 2 other overlapping clones is a T. Therefore, in Fig. 26, the residue in this position is designated 
as a T. 

The ^-terminal sequence in clone 33f HCV cDNA is TTGC. However, the corresponding sequence in 
clone 33g and in 2 other overlapping clones is ATTC. Therefore, in Fig. 26, the corresponding region is 
70 represented as ATTC. 

Nucleotide residue #4 in clone 33g HCV cDNA is a T. However, in clone 33f and in 2 other overlapping 
clones the corresponding residue is an A. Therefore, in Fig. 26, the corresponding residue is designated as 
an A. 

The 3'-terminus of clone 14i is an AA, whereas the corresponding dinucleotide in clone 11b, and in 
75 three other clones, is TA. Therefore, in Fig. 26, the TA residue is depicted. 
The resolution of other sequence heterogeneities is discussed supra. 

An examination of the composite HCV cDNA indicates that it contains one large ORF. This suggests 
that the viral genome is translated into a large polypeptide which is processed concomitant with, or 
subsequent to translation. 

20 

IV.A.19. Isolation and Nucleotide Sequences of HCV cDNAs in Clones 12f, 35f t 19g, 26g, and 15e 

The HCV cDNAs in clones 12f, 35f, 19g, 26g, and 15e were isolated essentially by the technique 
described in Section IV.A.17, except that the probes were as indicated below. The frequency of clones 
25 which hybridized with the probes was approximately 1 in 50,000 in each case. The nucleotide sequences of 
the HCV cDNAs in these clones were determined essentially as described in Section I V.A.2., except that the 
cDNA from the indicated clones were substituted for the cDNA isolated from clone 5-1-1 . 

The isolation of clone 12f, which contains cDNA upstream of the HCV cDNA in Fig. 26, was 
accomplished using a hybridization probe based on the sequence of nucleotides in clone 14i. The 
30 nucleotide sequence of the probe was 

5* TGC TTG TGG ATG ATG CTA CTC ATA TCC CAA 3\ 
The HCV cDNA sequence of clone 12f, its overlap with clone 14i, and the amino acids encoded therein are 
shown in Fig. 27. 

The isolation of clone 35f, which contains cDNA downstream of the HCV cDNA in Fig. 26, was 
35 accomplished using a hybridization probe based on the sequence of nucleotides in clone 39c. The 
nucleotide sequence of the probe was 

5' AGC AGC GGC GTC AAA AGT GAA GGC TAA CTT 3V 
The sequence of clone 35f, its overlap with the sequence in clone 39c, and the amino acids encoded 
therein are shown in Fig. 28. 
40 The isolation of clone 19g was accomplished using a hybridization probe based on the 3' sequence of 
clone 35f. The nucleotide sequence of the probe was 

5' TTC TCG TAT GAT ACC CGC TGC TTT GAC TGC 3\ 
The HCV cDNA sequence of clone 19g, its overlap with the sequence in clone 35f, and the amino acids 
encoded therein are shown in Fig. 29. 
45 The isolation of clone 26g was accomplished using a hybridization probe based on the 3' sequence of 
clone 19g. The nucleotide sequence of the probe was 

5' TGT GTG GCG ACG ACT TAG TCG TTA TCT GTG 3\ 
The HCV cDNA sequence of clone 26g, its overlap with the sequence in clone 19g, and the amino acids 
encoded therein are shown in Fig. 30. 
so Clone 15e was isolated using a hybridization probe based on the 3' sequence of clone 26 g. The 
nucleotide sequence of the probe was 

5' CAC ACT CCA GTC AAT TCC TGG CTA GGC AAC 3'. 
The HCV cDNA sequence of clone 1 5e, its overlap with the sequence in clone 26g, and the amino acids 
encoded therein are shown in Fig. 31. 
55 The clones described in this Section have been deposited with the ATCC under the terms and 
conditions described in Section II A., and have been assigned the following Accession Numbers. 
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lambda-gtn 

ATCC No. 

Deposit Date 

clone 12f 

40514 

10 November 1988 

clone 35f 

40511 

10 November 1988 

clone 15e 

40513 

10 November 1988 

clone k9-1 

40512 

10 November 1988 


The HCV cDNA sequences in the isolated clones described supra, have been aligned to create a 
composite HCV cDNA sequence. The isolated clones, aligned in the 5, to 3 1 direction are: 12f, 14i, 7f, 7e, 
8h, 33c, 40b, 37b, 35, 36, 81 , 32, 33b, 25c, 14c, 8f 33f, 33g, 39c, 35f, 19g, 26g, and 15e. 

A composite HCV cDNA sequence derived from the isolated clones, and the amino acids encoded 
therein, is shown in Fig. 32. 

IV.A.20; Alternative Method of Isolating cDNA Sequences Upstream of the HCV cDNA Sequence in Clone 

12f " .. ■ 

Based on the most 5' HCV sequence in Fig. 32, which is derived from the HCV cDNA in clone 12f, 
small synthetic oligonucleotide primers of reverse transcriptase are synthesized and used to bind to the 
corresponding sequence in HCV genomic RNA, to prime reverse transcription of the upstream sequences. 
The primer sequences are proximal to the known 5'-terminal sequence of clone 12f, but sufficiently 
downstream to allow the design of probe sequences upstream of the primer sequences. Known standard 
methods of priming and cloning are used. The resulting cDNA libraries are screened with sequences 
upstream of the priming sites (as deduced from the elucidated sequence in clone 12f). the HCV genomic 
RNA is obtained from either plasma or liver samples from chimpanzees with NANBH, or from analogous 
samples from humans with NANBH. 

IV.A.21 . Alternative Method Utilizing Tailing to Isolate Sequences from the S'-Terminal Region of the HCV 
Genome 

In order to isolate the extreme 5Merminal sequences of the HCV RNA genome, the cDNA product of 
the first round of reverse transcription, which is duplexed with the template RNA, is tailed with oligo C. This 
is accomplished by incubating the product with terminal transferase in the presence of CTP. The second 
round of cDNA synthesis, which yields the complement of the first strand of cDNA, is accomplished utilizing 
oligo G as a primer for the reverse transcriptase reaction. The sources of genomic HCV RNA are as. 
described in Section IV.A.20. The methods for tailing with terminal transferase, and for the reverse 
transcriptase reactions are as in Maniatis et al. (1982). The cDNA products are then cloned, screened/and 
sequenced. 

IV.A.22. Alternative Method Utilizing Tailing to Isolate Sequences from the 3'-Terminal Region ; of the HCV 
Genome 

This method is based on previously used methods for cloning cDNAs of Flavivirus RNA. In this method, 
the RNA is subjected to denaturing conditions to remove secondary structures at the 3'-terminus, and is 
then tailed with Poly A polymerase using rATP as a substrate. Reverse transcription of the poly A tailed 
RNA is catalyzed by reverse transcriptase, utilizing oligo dT as a primer. The second strands of cDNA are 
synthesized, the cDNA products are cloned, screened, and sequenced. 

IV. A.23 Creati on of Lambda-gt11 HCV cDNA Libraries Containing Larger cDNA Inserts 

The method used to create and screen the Lambda gtll libraries are essentially as described in 
Section IV.A.1., except that the library is generated from a pool of larger size cDNAs eluted from the 
Sepharose CL-4B column. 

IV.A.24. Creation of HCV cDNA Libraries Using Synthetic Oligomers as Primers 

New HCV cDNA libraries have been prepared from the RNA derived from the infectious chimpanzee 
plasma pool described in Section IV.A.1., and from the poly A + RNA fraction derived from the liver of this 
infected animal. The cDNA was constructed essentially as described by Gubler and Hoffman (1983), except 
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that the primers for the first c DNA strand synthesis were two synthetic oligomers based on the sequence of 
the HCV genome described supra. Primers based on the sequence of clone 11 b and 7e were, respectively, 

6 5' CTG GCT TGA AGA ATC 3 ' 

. and 

70 

5' AGT TAG GCT GGT GAT TAT GC 3'v 

. The resulting cDNAs were cloned into lambda bacteriophage vectors, and screened with various other 
75 synthetic oligomers, whose sequences were based on the HCV sequence in Fig. 32. 

IV.B. Expressi on of Polypeptides Encoded Within HCV cDNAs and Identification of the Expressed Products 
as HCV Induced Antigens . 

20 IV.B.1 . Expression of the Polypeptide Encoded in Clone 5-1-1 . 

The HCV polypeptide encoded within clone: 5-1-1 (see Section IV.A.2., supra) was expressed as a 
fusion polypeptide with superoxide dismutase (SOD). This was accomplished by subcloning the clone 5-1-1 
cDNA insert into the expression vector pSODcfl (Steimer et al. (1986)) as follows. 
25 First, DNA isolated from pSODcfl was treated with BamHI and EcoRI, and the following linker was 
ligated into the linear DNA created by the restriction enzymes: 

5' GAT CCT GGA ATT CTG ATA A 3' 

30 ' 

3' GA CCT TAA GAC TAT TTT AA 5 ' 

After cloning, the plasmid containing the insert was isolated. 
35 Plasmid containing the insert was restricted with EcoRI. The HCV cDNA insert in clone 5-1-1 was 
excised with EcoRI, and ligated into this EcoRI linearized plasmid DNA. The DNA mixture was used to 
transform E. coli strain D1210 (Sadler et al. (1980)). Recombinants with the 5-1-1 cDNA in the correct 
orientation Tofexpression of the ORF shown in Fig. 1 were identified by restriction mapping and nucleotide 
sequencing. 

40 Recombinant bacteria from one clone were induced to express the SOD-NANB5-1-1 polypeptide by 
growing the bacteria in the presence of IPTG. 

IV.B.2, Expression of the Polypeptide Encoded in Clone 81. 

45 the HCV cDNA contained within clone 81 was expressed as a: SOD-NANB 8 i fusion polypeptide. The 
method for preparing the vector encoding this fusion polypeptide was analogous to that used for the 
creation of the vector encoding SOD-NANB5-1 -1 , except that the source of the HCV cDNA was clone 81 , 
which was isolated as described in Section IV A3, and for which the cDNA sequence was determined as 
described in Section IV.A.4. The nucleotide sequence of the HCV cDNA in clone 81 , and the putative amino 

50 acid sequence of the polypeptide encoded therein are shown in. Fig. 4. 

The HCV cDNA insert in clone 81 was excised with EcoRI, and ligated into the pSODcfl which 
contained the linker (see IV.B.1.) and which was linearized by treatment with EcoRI. The DNA mixture was 
used to transform E. coli strain D1210. Recombinants with the clone 81 HCV cDNA in the correct orientation 
for expression of~*the~ORF shown in Fig. 4 were identified by restriction mapping and nucleotide 

55 sequencing. 

Recombinant bacteria from one clone were induced to express the SOD-NANBisi polypeptide by 
growing the bacteria in! the presence of IPTG. 
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IV.B.3. Identification of the Polypeptide Encoded Within Clone 5-1-1 as an HCV and NANBH Associated 
Antigen . 

The polypeptide encoded within the HCV cDNA of clone 5-1-1 was identified as a NANBH associated 

5 antigen by demonstrating that sera of chimpanzees and humans infected with NANBH reacted im- 
munologically with the fusion polypeptide, SOD-NANB5-1-1, which is comprised of superoxide dismutase at 
its N-terminus and the in-frame 5-1-1 antigen at its C-terminus. This was accomplished by "Western" 
blotting (Towbin et al. (1979)) as follows. 

A recombinant strain of bacteria transformed with an expression vector encoding the SOD-NANB5-1-1 

70 polypeptide, described in Section IV.B.I., was induced to express the fusion polypeptide by growth in the 
presence of IPTG. Total bacterial lysate was subjected to electrophoresis through polyacrylamide gels in 
the presence of SDS according to Laemmli (1970). The separated polypeptides were transferred onto 
nitrocellulose filters (Towbin et al. (1979)). The filters were then cut into thin strips, and the strips were 
incubated individually with the different chimpanzee and human sera. Bound antibodies were detected by 

75 further incubation with 125 l-labeled sheep anti-human Ig, as described in Section IY.A.1. 

The characterization of the chimpanzee sera used for the Western blots and the results, shown in the 
photograph of the autoradiographed strips, are presented in Fig. 33. Nitrocellulose strips containing 
polypeptides were incubated with sera derived from chimpanzees at different times during acute NANBH 
(Hutchinson strain) infections (lanes 1-16), hepatitis A infections (lanes 17-24, and 26-33), and hepatitis B 

20 infections (lanes 34-44). Lanes 25 and 45 show positive controls in which the immunoblots were incubated 
with serum from the patient used to identify the recombinant clone 5-1-1 in the original screening of the 
Iambda-gt1 1 cDNA library (see Section IV.A.1 .). 

The band visible in the control lanes, 25 and 45, in Fig. 23 reflects the binding of antibodies to the 
NANB5-1-1 moiety of the SOD fusion polypeptide. These antibodies do not exhibit binding to SOD alone, 

25 since this has also been included as a negative control in these samples, and would have appeared as a 
band migrating significantly faster than the SOD-NANB s -i -1 fusion polypeptide. 

Lanes 1-16 of Fig. 33 show the binding of antibodies in sera samples of 4 chimpanzees; the sera were 
obtained just prior to infection with NANBH, and sequentially during acute infection. As seen from the figure, 
whereas antibodies which reacted immunologically with the SOD-NANB5-1-1 polypeptide were absent in 

30 sera samples obtained before administration of infectious HCV inoculum and during the early acute phase 
of infection, all 4 animals eventually induced circulating antibodies to this polypeptide during the late part of, 
or following the acute phase. Additional bands observed on the immunoblots in the cases of chimps 
numbers 3 and 4 were due to background binding to host bacterial proteins. 

In contrast to the results obtained with sera from chimps infected with NANBH, the development of 

35 antibodies to the NANB5-1-1 moiety of the fusion polypeptide was not observed in 4 chimpanzees infected 
with HAV or 3 chimpanzees infected with HBV. The only binding in these cases was background binding to 
the host bacterial proteins, which also occurred in the HCV infected samples. 

The characterization of the human sera used for the Western blots, and the results, which are shown in 
the photograph of the autoradiographed strips, are presented in Fig. 34. Nitrocellulose strips containing 

40 polypeptides were incubated with sera derived from humans at different times during infections with NANBH 
(lanes 1-21), HAV (lanes 33-40), and HBV (lanes 41-49). Lanes 25 and 50 show positive controls in which 
the immunoblots were incubated with serum from patient used in the original screening of the Iambda-gt1 1 
library, described supra. Lanes 22-24 and 26-32 show "non-infected" controls in which the sera was from 
"normal" blood donors. 

45 As seen in Fig. 34, sera from nine NANBH patients, including the serum used for screening the tambda- 
gt11 library, contained antibodies to . the NANB5- 1-1 moiety of the fusion polypeptide. Sera from three 
patients with NANBH did not contain these antibodies. It is possible that the anti-NANBs-i -i antibodies will 
develop at a future date in these patients. It is also possible that this lack of reaction resulted from a 
different NANBV agent being causative of the disease in the individuals from which the non-responding 

50 serum was taken. 

Fig. 34 also shows that sera from many patients infected with HAV and HBV did not contain anti- 
NANB5-1-1 antibodies, and that these antibodies were also not present in the sera from "normal" controls. 
Although one HAV patient (lane 36) appears to contain anti-NANBs-i-i antibodies, it is possible that this 
patient had been previously infected with HCV, since the incidence of NANBH is very high and since it is 
55 often subclinical. 

These serological studies indicate that the cDNA in clone 5-1-1 encodes epitopes which are recognized 
specifically by sera from patients and animals infected with BB-NANBV. In addition, the cDNA does not 
appear to be derived from the primate genome. A hybridization probe made from clone 5-1-1 or from clone 
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81 did not hybridize to "Southern" blots of control human and chimpanzee genomic DNA from uninfected 
individuals under conditions where unique, single-copy genes are detectable. These probes also did not 
hybridize to Southern blots of control bovine genomic DNA. 

5 IV.B.4. Expression of the Polypeptide Encoded in a Composite of the HCV cDNAs in Clones 36, 81 and 32 

The HCV polypeptide which is encoded in the ORF which extends through clones 36, 81 and 32 was 
expressed as a fusion polypeptide with SOD. This was accomplished by inserting the composite cDNA, 
C100, into an expression cassette which contains the human superoxide dismutaise gene, inserting the 

70 expression cassette into a yeast expression vector, and expressing the polypeptide in yeast. 

An expression cassette containing the composite C100 cDNA derived from clones 36, 81, and 32, was 
constructed by inserting the ~1270bp EcoRI fragment into the EcoRI site of the vector pS3-56 (also called 
pS356), yielding the plasmid pS3-56 C ibo. The construction of C1 00 is described in Section IV.A.1 6, supra. 
The vector pS3-56, which is a pBR322 derivative* contains an expression cassette which is comprised 

75 of the ADH2/GAPDH hybrid yeast promoter upstream of the human superoxide dismutase gene, and a 
downstream GAPDH transcription terminator. A similar cassette, which contains these control elements and 
the superoxide dismutase gene has been described in Cousens et al. (1987), and in copending application 
EPO 196,056, published October 1, 1986, which is commonly owned by the herein assignee. The cassette 
in pS3-56, however, differs from that in Cousens et al. (1987) in that the heterologous proinsulin gene and 

20 the immunoglobulin hinge are deleted, and in that the glnis4 of the superoxide dismutase is followed by an 
adaptor sequence which contains an EcoRI site. The sequence of the adaptor is: 

5 ' - AAT TTG GGA ATT CCA TAA TGA G -3 ' 

25 AC CCT TAA GGT ATT ACT CAG CT 

The EcoRI site allows the insertion of heterologous sequences which, when expressed from a vector 
containing the cassette, yield polypeptides which are fused to superoxide dismutase via an oligopeptide 
30 linker containing the amino acid sequence: 
-asn-leu-gly-ile-arg-. 

A sample of pS356 has been deposited on 29 April 1988 under the terms of the Budapest Treaty with 
the American Type Culture Collection (ATCC), 12301 Parklawn Dr., Rockville, Maryland 20853, and has 
been assigned Accession No. 67683. The terms and conditions for availability ^nd access to the deposit,; 

35 and for maintenance of the deposit are the same as those specified in Section (I.A., for strains containing 
NANBV-cDNAs. This deposit is intended for convenience only, and is not required to practice the present 
invention in view of the description here. The deposited material is hereby incorporated herein by reference. 

After recombinants containing the C100 cDNA insert in the correct orientation were isolated, the 
expression cassette containing the C100 cDNA was excised from pS3-56 C ioo with BamHI, and a fragment of 

40 ~3400bp which contains the cassette was isolated and purified. This fragment was then inserted into the : : 
BamHI site of the yeast vector pAB24. 

Plasmid pAB24, the significant features of which are shown in Fig. 35, is a yeast shuttle vector which = 
contains the complete 2 micron sequence for replication [Broach (1981)] and pBR322 sequences. It also 
contains the yeast URA3 gene derived from plasmid YEp24 [Botstein et al. (1979)], and the yeast LEU 2d 

45 gene derived from plasmid pC1/1. EPO Pub. No. 116,201. Plasmid pAB24 was constructed by digesting 
YEp24 with EcoRI and religating the vector to remove the partial 2 micron sequences. The resulting 
plasmid, YEP24deltaRI, was linearized by digestion with Clal and ligated with the complete 2 micron 
plasmid which had been linearized with Clal. The resulting plasrnid, pCBou, was then digested with Xbal 
and the 8605 bp vector fragment was gel isolated. This isolated Xbal fragment was ligated with a 4460 bp 

so Xbal fragment containing the LEU 2d gene isolated from pC1/1; the orientation of the LEU 2d gene is in the 
same direction as the URA3 gene. Insertion of the expression was in the unique BamHI site of the pBR322 
sequence, thus interrupting the gene for bacterial resistance to tetracycline. 

The recombinant plasmid which contained the SOD-C100 expression cassette, pAB24C1 (XK3, was 
transformed into yeast strain JSC 308, as well as into other yeast strains. The cells were transformed as 

55 described by Hinnen et al. (1978), and plated onto ura-selective plates. Single colonies were inoculated into 
leu-selective media and grown to saturation. The culture was induced to express the SOD-C100 polypeptide 
(called C100-3) by growth in YEP containing 1 % giucose. 
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Strain JSC 308 is of the genotype MAT @, Ieu2, ura3(del) DM15 (GAP/ADR1) integrated at the ADR1 
locus. In JSC 308, over-expression of the positive activator gene product, ADR1, results in hyperderepres- 
sion (relative to an ADR1 wild type control) and significantly higher yields of expressed heterologous 
proteins when such proteins are synthesized via an ADH2 UAS regulatory system. A sample of JSC 308 
5 has been deposited on 5 May 1988 with the ATCC under the conditions of the Budapest Treaty, and has 
been assigned Accession No. 20879. The terms and conditions for availability and access to the deposit, 
and for maintenance of the deposit are the same as those specified in Section M.A., for strains containing 
HCV cDNAs. 

The complete C100-3 fusion polypeptide encoded in pAB24C 100-3 should contain 154 amino acids of 
io human SOD at the amino-terminus, 5 amino acid residues derived from the synthetic adaptor containing the 
EcoRI site, 363 amino acid residues derived from C100 cDNA, and 5 carboxy-terminal amino acids derived 
from the MS2 nucleotide, sequence adjoining the HCV CDNA sequence in clone 32. (See Section IV.A.7,) 
The putative amino acid sequence of the carboxy-terminus of this polypeptide, beginning at the penultimate 
Ala residue of SOD, is shown in Fig. 36; also shown is the nucleotide sequence encoding this portion of the 
75 polypeptide. 

IV.B.5. Identification of the Polypeptide Encoded within C100 as an NANBH Associated Antigen 

The C 100-3 fusion polypeptide expressed from plasmid pAB24C1 00-3 in yeast strain JSC 308 was : 

20 characterized with respect to size, and the polypeptide encoded within C100 was identified as an NANBH- 
associated antigen by its immunological reactivity with serum from a human with chronic NANBH. 

The C100-3 polypeptide, which was expressed as described in Section IV.B.4., was analyzed as follows. 
Yeast JSC 308 cells were transformed with pAB24, or with pAB24C1 00-3, and were induced to express the 
heterologous plasmid encoded polypeptide. The induced yeast cells in 1 ml of culture (OD 650 nm ~20) were 

25 pelleted by centrifugation at 10,000 rpm for 1 minute, and were lysed by vortexing them vigorously (10 x 1 
min) with 2 volumes of solution and 1 volume of glass beads (0.2 millimicron diameter). The solution 
contained 50 mM Tris-HCI, pH 8.0, 1 mM EDTA, 1mM phenylmethylsulphonyl fluoride (PMSF), and 1 
microgram/ml pepstatin. Insoluble material in the lysate, which includes the C100-3 polypeptide, was 
collected by centrifugation (10,000 rpm for 5 minutes), and was dissolved by boiling for 5 minutes in 

30 Laemmli SDS sample buffer. [See Laemmli (1970)]. An amount of polypeptides equivalent to that in 0.3 ml 
of the induced yeast culture was subjected to electrophoresis through 10% polyacrylamide gels in the 
presence of SDS according to Laemmli (1970). Protein standards were co-electrophoresed on the gels. Gels 
containing the expressed polypeptides were either stained with Coomassie brilliant blue, or were subjected 
to "Western" blotting as described in Section IV.B.2., using serum from a patient with chronic NANBH to 

35 determine the immunological reactivity of the polypeptides expressed from pAB24 and from pAB24C1 00-3. 
The results are shown in Fig. 37. In Fig. 37A the polypeptides were stained with Coomassie brilliant 
blue. The insoluble polypeptide(s) from JSC 308 transformed with pAB24 and from two different colonies of 
JSC transformed with pAB24C100-3 are shown in lane 1 (pAB24), and lanes 2 and 3, respectively. A 
comparison of lanes 2 and 3 with lane 1 shows the induced expression of a polypeptide corresponding to a 

40 molecular weight of "54,000 daltons from JSC 308 transformed with pAB24C100-3, which is not induced in 
JSC 308 transformed with pAB24. This polypeptide is indicated by the arrow. 

Fig. 37B shows the results of the Western blots of the insoluble polypeptides expressed in JSC 308 
transformed with pAB24 (lane 1), or with pAB24C1 00-3 (lane 2). Trie polypeptides expressed from pAB24 
were not immunologically reactive with serum from a human with NANBH. However, as indicated by the 

45 arrow, JSC 308 transformed with pAB24C100-3 expressed a polypeptide of "54,000 dalton molecular weight : 
which did react immunologically with the human NANBH serum. The other immunologically .reactive 
polypeptides in lane 2 may be degradation and/or aggregation products of this "54,000 dalton polypeptide. 

IV.B.6. Purification of Fusion Polypeptide C100-3 

50 

The fusion polypeptide, C1 00-3, comprised of SOD at the N-terminus and in-frame C100 HCV- 
polypeptide at the C-terminus was purified by differential extraction of the insoluble fraction of the extracted 
host yeast cells in which the polypeptide was expressed. 

The fusion polypeptide, C100-3, was expressed in yeast strain JSC 308 transformed with pAB24C100-3, 
55 as described in Section IV.B.4, The yeast cells were then lysed by homogenization, the insoluble material in 
the lysate was extracted at pH 12.0, and C100-3 in the remaining insoluble fraction was solubilized in buffer 
containing SDS. 
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The yeast lysate was prepared essentially according to Nagahuma et al. (1984). A yeast cell suspension 
was prepared which was 33% cells (v/v) suspended in a solution (Buffer A) containing 20 mM Tris HCI, pH 
8.0, 1 mM dithiothreitol, and 1 mM phenylmethylsulfonylfluoride (PMSF). An aliquot of the suspension (15 
ml) was mixed with an equal volume of glass beads (0.45-0.50 mm diameter), and the mixture was vortexed 

5 at top speed on a Super Mixer (Lab Line Instruments, Inc.) for 8 min. The homogenate and glass beads 
were separated, and the glass beads were washed 3 times with the same volume of Buffer A as the original 
packed cells. After combining the washes and homogenate, the insoluble material in the lysate was obtained 
by centrifuging the homogenate at 7,000 x g for 15 minutes at 4*C, resuspending the pellets in Buffer A 
equal to twice the volume of original packed cells, and re-pelleting the material by centrifugation at 7,000 x 

70 g for 15 min. This washing procedure was repeated 3 times. 

The insoluble material from the lysate was extracted at pH 12.0 as follows. The pellet was suspended in 
buffer containing 0.5 M NaCI, 1 mM EDTA, where the suspending volume was equal to 1 .8 times the of the 
original packed cells. The pH of the suspension was adjusted by adding 0.2 volumes of 0.4 M Na 
phosphate buffer, pH 12.0. After mixing, the suspension was centrifuged at 7,000 x g for 15 min at 4*C, and 

15 the supernatant removed. The extraction was repeated 2 times. The extracted pellets were washed by 
suspending them in 0.5 M NaCI, 1 mM EDTA, using a suspension volume equal to two volumes of the 
original packed cells, followed by centrifugation at 7,000 x g for 15 min at 4'C. 

The C100-3 polypeptide in the extracted pellet was solubilized by treatment with SDS. The pellets were 
suspended in Buffer A equal to 0.9 volumes of the original packed cell volume, and 0.1 volumes of 2% SDS 

20 was added. After the suspension was mixed, it was centrifuged at 7,000 x g for 15 min at 4*C. The 
resulting pellet was extracted 3 more times with SDS. The resulting supernatants, which contained C100-3 
were pooled. 

This procedure purifies C100-3 more than 10-fold from the insoluble fraction of the yeast homogenate, 
and the recovery of the polypeptide is greater than 50%., 
25 The purified preparation of fusion polypeptide was analyzed by polyacrylamide gel electrophoresis 
according to Laemmli (1970). Based upon this analysis, the polypeptide was greater than 80% pure, and 
had an apparent molecular weight of "54,000 daltons. 

IV.C. Identification of RNA in Infected Individuals Which Hybridizes to HCV cDNA . 
30 ~ ~~ 

IV.C.1 . Identification of RNA in the Liver of a Chimpanzee With NANBH Which Hybridizes to HCV cDNA . 

RNA from the liver of a chimpanzee which had NANBH was shown to contain a species of RNA which 
hybridized to the HCV cDNA contained within clone 81 by Northern blotting, as follows. 

35 RNA was isolated from a liver biopsy of the chimpanzee from which the high titer plasma was derived 
(see Section IV.A.1.) using techniques described in Maniatis et al. (1982) for the isolation of total RNA from 
mammalian cells, and for its separation into poly A + and poly A~ fractions. These RNA fractions were 
subjected to electrophoresis on a formaldehyde/agarose gel (1% w/v), and transferred to nitrocellulose. 
(Maniatis et al. (1982)). The nitrocellulose filters were hybridized with radiolabeled HCV CDNA from clone 

40 81 (see Fig. 4 for the nucleotide sequence of the insert.) To prepare the radiolabeled probe, the HCV cDNA 
insert isolated from clone 81 was radiolabeled with 32 P by nick translation using DNA Polymerase I (Maniatis 
et al. (1982)). Hybridization was for 18 hours at 42* C in a solution Containing 10% (w/v) Dextran sulphate, 
50% (w/v) deionized formamide, 750 mM NaCI, 75 mM Na citrate, 20 mM NaaHPO*. pH 6.5, 0.1% SDS, 
0.02% (w/v) bovine serum albumin (BSA), 0.02% (w/v) Ficoll-400, 0.02% (w/v) polyvinylpyrrolidone, 100 

45 micrograms/ml salmon sperm DNA which had been sheared by sonication and denatured, and 10 6 CPM/ml 
of the nick-translated CDNA probe. 

An autoradiograph of the probed filter is shown in Fig. 38. Lane 1 contains 32 P-labeled restriction 
fragment markers. Lanes 2-4 contain chimpanzee liver RNA as follows: lane 2 contains 30 micrograms of 
total RNA; lane 3 contains 30 micrograms of poly A- RNA; and lane 4 contains 20 micrograms of poly A + 

50 RNA. As shown in Fig. 38, the liver of the chimpanzee with NANBH contains a heterogeneous population of 
related poly A + RNA molecules which hybridizes to the HCV CDNA probe, and which appears to be from 
about 5000 nucleotides to about 11,000 nucleotides in size. This RNA, which hybridizes to the HCV cDNA, 
could represent viral genomes and/or specific transcripts of the viral genome. 

The experiment described in Section IV.C.2., infra, is consistent with the suggestion that HCV contains 

55 an RNA genome. 
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IV.C.2. Identification of HCV Derived RNA in Serum from Infected Individuals . 

Nucleic acids were extracted from particles isolated from high titer chimpanzee NANBH plasma as 
described in Section IV.A.1.. Aliquots (equivalent to 1 ml of original plasma) of the isolated nucleic acids 

5 were resuspended in 20 microliters 50 mM Hepes, pH 7.5, 1 mm EDTA and 16 micrograms/ml yeast 
soluble RNA. The samples were denatured by boiling for 5 minutes followed by immediate freezing, and 
were treated with RNase A (5 microliters containing 0.1 mg/ml RNase A in 25 mM EDTA, 40 mM Hepes, pH 
7.5) or with DNase I (5 microliters containing 1 unit DNase: I in 10 mM MgCI 2 , 25 mM Hepes, pH 7.5); 
control samples were incubated without enzyme. Following incubation, 230 microliters of ice-cold 2XSSC 

10 containing 2 micrograms/ml yeast soluble RNA was added, and the samples were filtered on a nitrocellulose 
filter. The filters were hybridized with a cDNA probe from clone 81, which had been 32 P-labeled by nick-? 
translation. Fig. 39 shows an autoradiograph of the filter. Hybridization signals were detected in the DNase 
treated and control samples (lanes 2 and 1, respectively), but were not detected in the RNase treated 
sample (lane 3). Thus, since RNase A treatment destroyed the nucleic acids isolated from the particles, and 

75 DNase I treatment had no effect, the evidence strongly suggests that the HCV genome is composed of 
RNA. 

IV.C.3. Detection of Amplified HCV Nucleic Acid Sequences derived from HCV Nucleic Acid Sequences in 
Liver and Plasma Specimens from Chimpanzees with NANBH 

20 '' ' ' _." : ' • ~ ' ~ ~~ ' ; 

HCV nucleic acids present in liver and plasma of chimpanzees with NANBH, and in control chim- 
panzees, were amplified using essentially the polymerase chain reaction (PCR) technique described by 
Saiki et al. (1986). The primer oligonucleotides were derived from the HCV cDNA sequences in clone 81, or 
clones 36 and 37. The amplified sequences were detected by gel electrophoresis and Southern blotting, 

25 using as probes the appropriate cDNA oligomer with a sequence from the region between, but not 
including, the two primers. 

Samples of RNA Containing HCV sequences to be examined by the amplification system were isolated 
from liver biopsies of three chimpanzees with NANBH, and from two control chimpanzees. The isolation of 
the RNA fraction was by the guanidinium thiocyanate procedure described in Section IV.C.1 . 

30 Samples of RNA which were to be examined by the amplification system were also isolated from the 
plasmas of two chimpanzees with NANBH, and from one control chimpanzee, as well as from a pool of 
plasmas from control chimpanzees. One infected chimpanzee had a CID/ml equal to or greater than 10 s , 
and the other infected chimpanzee had a CID/ml equal to or greater than 10 s . 

The nucleic acids were extracted from the plasma as follows. Either 0.1 ml or 0,01 ml of plasma was 

35 diluted to a final volume of 1.0 ml, with a TENB/proteinase K/SDS solution (0.05 M Tris-HCL, pH 8.0, 0.001 
M EDTA, 0.1 M NaCI, 1 mg/ml Proteinase K, and 0.5% SDS) containing 10 micrograms/ml polyadenylic 
acid, and incubated at 37 °C for 60 minutes. After this proteinase K digestion, the resultant plasma fractions 
were deproteinized by extraction with TE (10.0 mM Tris-HCI, pH 8.0, 1 mM EDTA) saturated phenol. The 
phenol phase was separated by centrifugation, and was reextracted with TENB containing 0.1% SDS. The; 

40 resulting aqueous phases from each extraction were pooled, and extracted twice with an equal volume of 
phenol/chloroform/isoamyl alcohol [1:1(99:2)], and then twice with an equal volume of a 99:1 mixture of 
chloroform/isoamyl alcohol. Following phase separation by centrifugation, the aqueous phase was brought to ; 
a final concentration of 0.2 M Na Acetate, and the nucleic acids were precipitated by the addition of two 
volumes of ethanol. The precipitated nucleic acids were recovered by ultracentrifugation in a SW 41 rotor at ; 

45 38 K, for 60 minutes at 4 • C. 

In addition to the above, the high titer chimpanzee plasma and the pooled control plasma alternatively 
were extracted with 50 micrograms of poly A carrier by the procedure of Chomcyzski and Sacchi (1987). 
This procedure uses an acid guanidinium thiocyanate extraction. RNA was recovered by centrifugation at 
10,000 RPM for 10 minutes at 4° C in an Eppendorf microfuge. 

so On two occasions, prior to the synthesis of cDNA in the PCR reaction, the nucleic acids extracted from 
plasma by the proteinase K/SDS/phenol method were further purified by binding to and elution from S and 
S Elutip-R Columns. The procedure followed was according to the manufacturer's directions. 

The cDNA used as a template for the PCR reaction was derived from the nucleic acids (either total 
nucleic acids or RNA) prepared as described above. Following ethanol precipitation, the precipitated nucleic 

55 acids were dried, and resuspended in DEPC treated distilled water. Secondary structures in the nucleic 
acids were disrupted by heating at 65 °C for 10 minutes, and the samples were immediately cooled on ice. _ 
cDNA was synthesized using 1 to 3 micrograms of total chimpanzee RNA from liver, or from nucleic acids 
(or RNA) extracted from 10 to 100 microliters of plasma. The synthesis utilized reverse transcriptase, and 
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was in a 25. microliter reaction, using the protocol specified by the manufacturer, BRL The primers for 
cDNA synthesis were those also utilized in the PCR reaction, described below. All reaction mixtures for 
cDNA synthesis contained 23 units of the RNAase inhibitor, RNASIN (Fisher/Prpmega). Following cDNA 
synthesis, the reaction mixtures were diluted with water, boiled for 10 minutes, and quickly chilled on ice. 
5 The PCR reactions were performed essentially according to the manufacturer's directions (Cetus- 
Perkin-Elmer), except for the addition of 1 microgram of RNase A, The reactions were carried out in a final 
volume of 100 microliters. The PCR was performed for 35 cycles, utilizing a regimen of 37*C, 72 # C, and 
94'C. 

The primers for cDNA synthesis and for the PCR reactions were derived from the HCV cDNA 
w sequences in either clone 81, clone 36, or clone 37b. (The HCV cDNA sequences of clones 81, 36, and 37b 
are shown in Figs. 4, 5, and 10, respectively.) The sequences of the two 16-mer primers derived from clone 
81 were: 

is 5' CAA TCA TAC CTG AC A G 3' 

and 

5 ' GAT AAC CTC TGC CTG A 3 ' . 

20 

The sequence of the primer from clone 36 was: 

5' GCA TGT CAT GAT GTA T 3\ 
The sequence of the primer from clone 37b was: 
5' ACA ATA CGT GTG TCA C 3\ 

25 In the PCR reactions, the primer pairs consisted of either the two 1 6-mers derived from clone 81, or the 16- 
mer from done 36 and the 16-mer from clone 37b. 

The PCR reaction products were analyzed by separation of the products by alkaline gel electrophoresis, 
followed by Southern blotting, and detection of the amplified HCV-cDNA sequences with a 32 P-labeled 
internal oligonucleotide probe derived from a region of the HCV cDNA which does not overlap the primers. 

30 The PCR reaction mixtures were extracted with phenol/chloroform, and the nucleic acids precipitated from 
the aqueous phase with salt and ethanol. The precipitated nucleic acids were collected by centrifugation, 
and dissolved in distilled water. Aliquots of the samples were subjected to electrophoresis on 1 .8% alkaline; 
agarose gels. Single stranded DNA of 60, 108, and 161 nucleotide lengths were co-electrophoresed on the 
gels as molecular weight markers. After electrophoresis, the DN As in the gel were, transferred onto Biorad 

35 Zeta Probe T " paper. Prehybridization and hybridization, and wash conditions were those specified by the 
manufacturer (Biorad). 

The probes used for the hybridization-detection of amplified HCV cDNA sequences were the following. 
When the pair of PCR primers were derived from clone 81, the probe was an 108-mer with a sequence 
corresponding to that which is located in the region between the sequences of the two primers. When the 

40 pair of PCR primers were derived from clones 36 and 37b, the probe was the nick-translated HCV cDNA 
insert derived from clone 35. The primers are derived from nucleotides 155-170 of the. clone. 37b insert, and 
206-268 of the clone 36 insert. The 3*-end of the HCV cDNA insert in clone 35 overlaps nucleotides 1-186 : : 
of the insert in clone 36; and the 5'-end of clone 35 insert overlaps nucleotides 207-269 of the insert in . 
clone 37b. (Compare Figs. 5, 8 and 10.) Thus, the cDNA insert in clone 35 spans part of the region between 

45 the sequences of the clone 36 and 37b derived primers, and is useful as a probe for the amplified 
sequences which include these primers. 

Analysis of the RNA from the liver specimens was according to the above procedure utilizing both sets 
of primers and probes. The RNA from the liver of the three chimpanzees with NANBH yielded positive 
hybridization results for amplification sequences of the expected size (161 and 586 nucleotides for 81 and 

so 36 and 37b, respectively), while the control chimpanzees yielded negative hybridization results. The same 
results were achieved when the experiment was repeated three times. 

Analysis of the nucleic acids and RNA from plasma was also according to the above procedure utilizing 
the primers and probe from clone 81 . The plasmas were from two chimpanzees with NANBH, from a control 
chimpanzee, and pooled plasmas from control chimpanzees. Both of the NANBH plasmas contained nucleic 

55 acids/RNA which yielded positive results in the PCR amplified assay, while both of the control plasmas 
yielded negative results. These results have been repeatably obtained several times. 
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IV.D. Radioimmunoassay for Detecting HCV Antibodies in Serum from Infected Individuals 

Solid phase radioimmunoassays to detect antibodies to HCV antigens were developed based upon Tsu 
and Herzenberg (1980). Microtiter plates (Immulon 2, Removawell strips) are coated with purified polypep- 

5 tides containing HCV epitopes. The coated plates are incubated with either human serum samples 
suspected of containing antibodies to the HCV epitopes, or to appropriate controls. During incubation, 
antibody, if present, is immunologically bound to the solid phase antigen. After removal of the unbound 
material and washing of the microtiter plates, complexes of human antibody-NANBV antigen are detected 
by incubation with 125 l-labeled sheep anti-human immunoglobulin. Unbound labeled antibody is removed by 

jo aspiration, and the plates are washed. The radioactivity in individual wells is determined; the amount of 
bound human anti-HCV antibody is proportional to the radioactivity in the well. 

IV.D.1. Purification of Fusion Polypeptide SOD-NANB s-i-i- 

75 The fusion polypeptide SOD-NANB5-1-1, expressed in recombinant bacteria as described in Section 
IV.B.1., was purified from the recombinant E. coli by differential extraction of the cell extracts with urea, 
followed by chromatography on anion and cation exchange columns as follows. 

Thawed cells from 1 liter of culture were resuspended in 10 ml of 20% (w/v) sucrose containing 0.01 M 
Tris HCI, pH 8.0, and 0.4 ml of 0.5M EDTA, pH 8.0 was added. After 5 minutes at 0*C, the mixture was 

20 centrifuged at 4,000 x g for 10 minutes. The resulting pellet was suspended in 10 ml of 25% (w/v) sucrose 
containing 0.05 M Tris HCI, pH 8.0, 1 mM phenylmethylsulfonylfluoride (PMSF) and 1 microgram/ml 
pepstatin A, followed by addition of 0.5 ml lysozyme (10 mg/ml) and incubation at 0 • C for 10 minutes. After 
the addition of 10 ml 1% (v/v) Triton X-100 in 0.05 M Tris HCI. pH 8.0, 1 mM EDTA, the mixture was 
incubated an additional 10 min at 0*C with occasional shaking. The resulting viscous solution was 

25 homogenized by passage 6 times through a sterile 20-gauge hypodermic needle, and centrifuged at 13,000 
x g for 25 minutes. The pelleted material was suspended in 5 ml of 0.01 M Tris HCI pH 8.0, and the 
suspension centrifuged at 4,000 x g for 10 minutes. The pellet, which contained SOD-NANB5-1-1 fusion 
protein, was dissolved in 5 ml of 6 M urea in 0.02 M Tris HCI, pH 8.0, 1 mM dithiothreitol (Buffer A), and 
was applied to a column of Q-Sepharose Fast Flow equilibrated with Buffer A. Polypeptides were eluted 

30 with a linear gradient of 0.0 to 0.3 M NaCI in Buffer A. After elution, fractions were analyzed by 
polyacryiamide gel electrophoresis in the presence of SDS to determine their content of SOD-NANB5-1-1. 
Fractions containing this polypeptide were pooled, and dialyzed against 6 M urea in 0.02 M sodium 
phosphate buffer, pH 6.0, 1 mM dithiothreitol (Buffer B). The dialyzed sample was applied on a column of 
S-Sepharose Fast Flow equilibrated with Buffer B, and polypeptides eluted with a linear gradient of 0.0 to 

35 0.3 M NaCI in Buffer B. The fractions were analyzed by polyacryiamide gel electrophoresis for the presence 
of SOD-NANBs-1 -1 , and the appropriate fractions were pooled. 

The final preparation of SOD-NANB5-1-1 polypeptide was examined by electrophoresis on 
polyacryiamide gels in the presence of SDS. Based upon this analysis, the preparation was more than 80% 
pure. 

40 

IV.D.2. Purification of Fusion Polypeptide SOD-NANB ai. 

The fusion polypeptide SOD-NANB 8 i, expressed in recombinant bacteria as described in Section 
IV.B.2., was purified from recombinant E. coli by differential extraction of the cell extracts with urea, followed 
45 by chromatography on anion and catiorTexchange columns utilizing the procedure described for the 
isolation of fusion polypeptide SOD-NANB5-1-1 (See Section IV.D.1.). 

The final preparation of SOD-NANBsi polypeptide was examined by electrophoresis on polyacryiamide 
gels in the presence of SDS. Based upon this analysis, the preparation was more than 50% pure. 

50 IV.D.3. Detection of Antibodies to HCV Epitopes by Solid Phase Radioimmunoassay . 

Serum samples from 32 patients who were diagnosed as having NANBH were analyzed by radioim- 
munoassay (RIA) to determine whether antibodies to HCV epitopes present in fusion polypeptides SOD- 
NANB5-1-1 and SOD-NANBsi were detected. 
55 Microtiter plates were coated with SOD-NANB5-1-1 or SOD-NANB 8 i, which had been partially purified 
according to Sections IV.D.1. and IV.D.2., respectively. The assays were conducted as follows. 

One hundred microliter aliquots containing 0.1 to 0.5 micrograms of SOD-NANB5-1-1 or SOD-NANBb 1 
in 0.125 M Na borate buffer, pH 8.3, 0.075 M NaCI (BBS) was added to each well of a microtiter plate 
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(Dynatech Immulon 2 Removawell Strips). The plate was incubated at 4 *C overnight in a humid chamber, 
after which, the protein solution was removed and the wells washed 3 times with BBS containing 0.02% 
Triton X-100 (BBST). To prevent non-specific binding, the wells were coated with bovine serum albumin 
(BSA) by addition of 100 microliters of a 5 mg/ml solution of BSA in BBS followed by incubation at room 

5 temperature for 1 hour; after this incubation the BSA solution was removed. The polypeptides in the coated 
wells were reacted with serum by adding 100 microliters of serum samples diluted 1:100 in 0.01 M Na 
phosphate buffer, pH 7.2, 0.15 M NaCI (PBS) containing 10 mg/ml BSA, and incubating the serum 
containing wells for 1 hr at 37 *C. After incubation, the serum samples were removed by aspiration, and the 
wells were washed 5 times with BBST. Anti-NANBs-i-i and Anti-NANBa 1 bound to the fusion polypeptides 

io was determined by the binding of 12S l-labeled P(ab>2 sheep anti-human IgG to the coated wells. Aliquots of 
100 microliters of the labeled probe (specific activity 5-20 microcuries/microgram) were added to each well, 
and the plates were incubated at 37* C for 1 hour, followed by removal of excess probe by aspiration, and 5 
washes with BBST. The amount of radioactivity bound in each well was determined by counting in a counter 
which detects gamma radiation. 

15 The results of the detection of anti-NANBs-i-i and anti-NANB 8 i in individuals with NANBH is 
presented in Table 1 . 
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Table 1 

Detection of Anti-5-1-1 and Anti~81 in Sera of 
NANB, HAV and HBV Hepatitis Patients 
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Patient 
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Patient 

weference S/N 
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1. 
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20 


Sequential serum samples available from these patients 
^ TVD-Intravenus Drug User 
3 AVH»Acute viral hepatitis 
4 PT*Post transfusion 

25 . 

As seen in Table 1, 19 of 32 sera from patients diagnosed as having NANBH were positive with respect 
to antibodies directed against HCV epitopes present in SOD-NANB5-1 -1 and S0D-NANB B r. 

However, the serum samples which were positive were not equally immunologically reactive with SOD- 

30 NANB5-1-1 and SOD-NANB 8 i. Serum samples from patient No. 1 were positive to SOD-NAN Bs 1 but not to. 
SOD-NANB5-1-1. Serum samples from patients number 10, 15, and 17 were positive to SOD- NAN Bs -1-1 
but not to SOD-NANBsi. Serum samples from patients No. 3, 8, 11, and 12 reacted equally with both fusion 
polypeptides, whereas serum samples from patients No. 2, 4, 7, and 9 were 2-3 fold higher in the reaction 
to SOD-NANB5-1-1 than to SOD-NANB 8 i. These results suggest that NANB5-1-1 and NANB 8 i may 

35 contain at least 3 different epitopes; i.e., it is possible that each polypeptide contains at least 1 unique 
epitope, and that the two polypeptides share at least 1 epitope. 

IV.D.4. Specificity of the Solid Phase RIA for NANBH 

40 The specificity of the solid phase RIAs for NANBH was; tested by using the assay on serum from 
patients infected with HAV or with HBV and on sera from control individuals. The assays utilizing partially 
purified SOD-NAN B5-1 -1 and SOD- NAN B 8 1 were conducted essentially as described in Section IV.D.3, 
except that the sera was from patients previously diagnosed as having HAV or HBV, or from individuals who 
were blood bank donors. The results for sera from HAV and HBV infected patients are presented in table 1. 

45 The RIA was tested using 11 serum specimens from HAV infected patients, and 20 serum specimens from •' 
HBV infected patients. As shown in table 1, none of these sera yielded a positive immunological reaction 
with the fusion polypeptides containing BB-NANBV epitopes. 

The RIA using the NANB5-1-1 antigen was used to determine immunological reactivity of serum from 
control individuals. Out of 230 serum samples obtained from the normal blood donor population, only 2 

50 yielded positive reactions in the RIA (data not shown). It is possible that the two blood donors from whom 
these serum samples originated had previously been exposed to HCV. 

IV.D.5. Reactivity of NANB5-1-1 During the Course of NANBH Infection . 

55 The presence of anti-NANB 5 -i-i antibodies during the course of NANBH infection of 2 patients and 4 
chimpanzees was followed using RIA as described in Section IV.D.3. In addition the RIA was used to ; 
determine the presence or absence of anti-NANBs-i-i antibodies during the course of infection of HAV and 
HBV in infected chimpanzees. 
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The results, which are presented in Table 2, show that with chimpanzees and with humans, anti- 
NANBs-i-i antibodies were detected following the onset of the acute phase of NANBH infection. Anti- 
NANBs-i -1 antibodies were not detected in serum samples from chimpanzees infected with either HAV or 
HBV. Thus anti-NANBs-1-1 antibodies serve as a marker for an individual's exposure to HCV. 

Table 2 
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Patient/ Sample Date (Days) Hepatitis Anti-5-1-1 ALT 
Chimo 7 (o^inoculation day) Viruses (S/N) (mu/ml) 

1.17 7 

1.60 83 

1.55 5 
1.60 

0.77 15 

0.98 130 

1.77 8 

1.27 5 

1.74 

3.29 9 

2.77 6 

2.35 8 

2.74 96-156 (pool) 

2.05 9 

1.78 13 

1.82 11 

1.26 8-100 (pool) 
9 

0.52 10 


30 IV.E. Purification of Polyclonal Serum Antibodies to NANB5-1-1 

On the basis of the specific immunological reactivity of the SOD-NANB5-1-1 polypeptide with the 
antibodies in serum samples from patients with NANBH, a method was developed to purify serum 
antibodies which react immunologically with the epitope(s) in NANB5-1-1. This method utilizes affinity 

35 chromatography. Purified SOD-NANB5-1 -1 polypeptide (see Section IV.D.1) was attached to an insoluble 
support; the attachment is such that the immobilized polypeptide retains its affinity for antibody to 
NANB5-1-1. Antibody in serum samples is absorbed to the matrix-bound polypeptide. After washing to 
remove non-specifically bound materials and unbound materials, the bound antibody is released from the 
bound SOD-HCV polypeptide by change in pH, and/or by chaotropic reagents, for example, urea. 

40 Nitrocellulose membranes containing bound SOD-NANB5-1-1 were prepared as follows. A nitrocel- 
lulose membrane, 2.1 cm Sartorius of 0.2 micron pore size, was washed for 3 minutes three times with 
BBS. SOD-NANB5-1-1 was bound to the membrane by incubation of the purified preparation in BBS at 
room temperature for 2 hours; alternatively it was incubated at 4°C overnight. The solution containing 
unbound antigen was removed, and the filter was washed three times with BBS for three minutes per wash. 

45 The remaining active sites on the membrane were blocked with BSA by incubation with a 5 mg/ml BSA 
solution for 30 minutes. Excess BSA was removed by washing the membrane with 5 times with BBS and 3 
times with distilled water. The membrane containing the viral antigen and BSA was then treated with 0.05 M 
glycine hydrochloride, pH 2.5, 0.10 M NaCI (GlyHCI) for 15 minutes, followed by 3 three minute washes 
with PBS. 

50 Polyclonal anti-NANB s -i -1 antibodies were isolated by incubating the membranes containing the fusion 
polypeptide with serum from an individual with NANBH for 2 hours. After the incubation, the filters were 
washed 5 times with BBS, and twice with distilled water. Bound antibodies were then eluted from each filter 
with 5 elutions of GlyHCI, at 3 minutes per elution. The pH of the eluates was adjusted to pH 8.0 by 
collecting each eluate in a test tube containing 2.0 M Tris HCI, pH 8.0. Recovery of the anti-NANBs-i-i 

55 antibody after affinity chromatography is approximately 50%. 

The nitrocellulose membranes containing the bound viral antigen can be used several times without 
appreciable decrease in binding capacity. To reuse the membranes, after the antibodies have been eluted 
the membranes are washed with BBS three times for 3 minutes. They are then stored in BBS at 4 # C. 
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IV.F. The Capture of HCV Particles from Infected Plasma Using Purified Human Polyclonal Anti-HCV 
Antibodies; Hybridization of the Nucleic Acid in the Captured Particles to HCV cDNA 

IV.F.1 . The Capture of HCV Particles from Infected Plasma Using Human Polyclonal Anti-HCV Antibodies 

5 

Protein-nucleic acid complexes present in infectious plasma of a chimpanzee with NANBH were isolated 
using purified human polyclonal anti-HCV antibodies which were bound to polystyrene beads. 

Polyclonal anti-NANBs-i-i antibodies were purified from serum from a human with NANBH using the 
SOD-HCV polypeptide encoded in clone 5-1-1. The method for purification was that described in Section 
w IV.E. 

The purified anti-NANBs-i-i antibodies were bound to polystyrene beads (1/4" diameter, specular 
finish, Precision Plastic Ball Co., Chicago, Illinois) by incubating each at room temperature overnight with 1 
ml of antibodies (1 microgram/ml in borate buffered saline, pH 8.5). Following the overnight incubation, the 
beads were washed once with TBST [50 mM Tris HCI, pH 8.0, 150 mM NaCI, 0.05% (v/v) Tween 20], and 
75 then with phosphate buffered saline (PBS) containing 10 mg/ml BSA. 

Control beads were prepared in an identical fashion, except that the purified anti-NANBs-i -i antibodies 
were replaced with total human immunoglobulin. 

Capture of HCV from NANBH infected chimpanzee plasma using the anti-NANBs-1-1 antibodies bound 
to beads was accomplished as follows. The plasma from a chimpanzee with NANBH used is described in: : 
20 Section IV.A.1.. An aliquot (1 ml) of the NAN By infected chimpanzee plasma was incubated for 3 hours at 
37 'C with each of 5 beads coated with either anti-NANBs-i-r antibodies, or wjth control immunoglobulins. 
The beads were washed 3 times with TBST. 

IV.F.2. Hybridization of the Nucleic Acid in the Captured Particles to NANBV-cDNA 

The nucleic acid component released from the particles captured with anti-NANBs-i-i antibodies was 
analyzed for hybridization to HCV cDNA derived from clone 81 . 

HCV particles were captured from NANBH infected chimpanzee plasma, as described in IV.F.1. To 
release the nucleic acids from the particles, the washed beads were incubated for 60 min. at 37° C with 0.2 
30 ml per bead of a solution containing proteinase k (1 mg/ml), 10 mM Tris HCI, pH 7.5, 10 mM EDTA, 0.25% 
(w/v) SDS, 10 micrograms/ml soluble yeast . RNA, and the supernatant solution was removed. The 
supernatant was extracted with phenol and chloroform, and the nucleic acids precipitated with ethanol 
overnight at -20 *C. The nucleic acid precipitate was collected by centrifugation, dried; and dissolved in 50 
mM Hepes, pH 7.5. Duplicate aliquots of the soluble nucleic acids from the samples obtained from beads 
35 coated with anti-NANBs-i-i antibodies and with control beads containing total human immunoglobulin were 
filtered onto to nitrocellulose filters. The filters were hybridized with a 32 P-labeled, nick-translated probe 
made from the purified HCV cDNA fragment in clone 81. The methods for preparing the probe and for the 
hybridization are described in Section IV.C.1.. 

Autoradiographs of a probed filter containing the nucleic acids from particles captured by beads 
40 containing anti-NANBs-i -1 antibodies are shown in Fig. 40. The extract obtained using the anti-NAN.B5.-1 -1 
antibody (Ai,A2) gave clear hybridization signals relative to the control antibody extract (A3, At) and to 
control yeast RNA (Bi,B 2 ). Standards consisting of 1pg, 5pg, and 10pg of the purified, clone 81 cDNA 
fragment are shown in C1-3, respectively. 

These results demonstrate that the particles captured from NANBH plasma by anti-NANBs-i-i- 
45 antibodies contain nucleic acids which hybridize with HCV cDNA in clone 81 , and thus provide further 
: evidence that the cDNAs in these clones are derived from the etiologic agent for NANBH. 

IV.G. Immunological Reactivity of C1 00-3 with Purified Anti-NANBs -i -1 Antibodies 

50 The immunological reactivity of C1 00-3 fusion polypeptide with anti-NANBs-1-1 antibodies was deter- 
mined by a radioimmunoassay, in which the antigens which were bound to a solid phase were challenged 
with purified anti-NANB 5 -i -1 antibodies, and the antigen-antibody complex detected with 125 l-labeled sheep 
anti-human antibodies. The immunological reactivity of C100-3 polypeptide was compared with that of SOD- 
NANB5 -1-1 antigen. 

55 The fusion polypeptide C100-3 was synthesized and purified as described in Section IV.B.5. and in 
Section IV.B.6., respectively. The fusion polypeptide SOD-NANB5-1 -1 was synthesized and purified as 
described in Section IV.B.1. and in Section IV.D.1., respectively. Purified anti-NANBs-i -1 antibodies were 
obtained as described in Section IV.E. 
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One hundred microliter aliquots containing varying amounts of purified C100-3 antigen in 0.125M Na 
borate buffer, pH 8.3, 0.075M NaCI (BBS) was added to each well of a microtiter plate (Dynatech Immulon 2 
Removawell Strips). The plate was incubated at 4°C overnight in a humid chamber, after which, the protein 
solution was removed and the wells washed 3 times with BBS containing 0.02% Triton X-1 00 (BBST). To 

5 prevent non-specific binding, the wells were coated with BSA by addition of 100 microliters of a 5 mg/ml 
solution of BSA in BBS followed by incubation at room temperature for 1 hour, after which the excess BSA 
solution was removed. The polypeptides in the coated wells were reacted with purified anti-NANBs-i-i 
antibodies by adding 1 microgram antibody/well, and incubating the samples for 1 hr at 37 °C. After 
incubation, the excess solution was removed by aspiration, and the wells were washed 5 times with BBST. 

io Anti-NANBs-1-r bound to the fusion polypeptides was determined by the binding of 125 Habeied F'(ab)2 
sheep anti-human IgG to the coated wells. Aliquots of 100 microliters of the labeled probe (specific activity 
5-20 microcuries/microgram) were added to each well, and the plates were incubated at 37 *C for 1 hour, 
followed by removal of excess probe by aspiration, and 5 washes with BBST. The amount of radioactivity 
bound in each well was determined by counting in a counter which detects gamma radiation. 

75 The results of the immunological reactivity of CI00 with purified anti-NANBs-i -r as compared to that of 
NANB5-1-1 with the purified antibodies are shown in Table 3. 

Table 3 


Immunological Reactivity of C100-3 compared to NANB5-1-1 by Radioimmunoassay 

AG(ng) 

RIA (cpm/assay) 


400 

320 

240 

160 

60 

0 

NANB5-1-1 

7332 

6732 

4954 

4050 

3051 

57 

C100-3 

7450 

6985 

5920 

5593 

4096 

67 


The results in Table 3 show thatanti-NANBs-1-1 recognizes an epitope(s) in the C1 00 moiety of the 
C100-3 polypeptide. Thus NANB5-1-1 and C100 share a common epitope(s). The results suggest that the 
cDNA sequence encoding this NANBV epitope(s) is one which is present in both clone 5-1-1 and in clone 
81. 

IV.H. Characterization of HCV 

IV.H.1. Characterization of the Strandedness of the HCV Genome. 

The HCV genome was characterized with respect to its strandedness by isolating the nucleic acid 
fraction from particles captured on anti-NANBs-i-i antibody coated polystyrene beads, and determining 
whether the isolated nucleic acid hybridized with plus arid/or minus strands of HCV cDNA. 

Particles were captured from HCV infected chimpanzee plasma using polystyrene beads coated with 
immunopurified anti-NANBs-i-i antibody as described in Section 1V.F.1. The nucleic acid component of 
the particles was released using the method described in Section IV. F.2. Ahquots of the isolated genomic 
nucleic acid equivalent to 3 mis of high titer plasma were blotted onto nitrocellulose filters. As controls, 
aliquots of denatured HCV cDNA from clone 81 (2 picograms) was also blotted onto the same filters. The 
filters were probed with 32 P-labeled mixture of plus or mixture of minus strands of single stranded DNA 
cloned from HCV cDNAs; the cDNAs were excised from clones 40b, 81 , and 25c. 

The single stranded probes were obtained by excising the HCV cDNAs from clones 81, 40b, and 25c 
with EcoRI, and cloning the cDNA fragments in M13 vectors, mp18 and mp19 [Messing (1983)]. The M13 
clones were sequenced to determine whether they contained the plus or minus strands of DNA derived 
from the HCV cDNAs. Sequencing was by the dideoxychain termination method of Sanger et al. (1977). 

Each of a set of duplicate filters containing aliquots of the HCV genome isolated from the captured 
particles was hybridized with either plus or minus strand probes derived from the HCV cDNAs. Fig. 41 
shows the autoradiographs obtained from probing the NANBV genome with the mixture of probes derived 
from clones 81 , 40b t and 25c. This mixture was used to increase the sensitivity of the hybridization assay. 
The samples in panel I were hybridized with the plus strand probe mixture. The samples in panel II were 
probed by hybridization with the minus strand probe mixture. The composition of the samples in the panels 
of the immunoblot are presented in table 4. 
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Table 4 


lane 

A 

B 

1 

HCV genome 

* 

2 


* 

3 

• 

cDNA 81 

4 


cDNA 81 


" is an undescribed sample. 
70 

As seen from the results in Fig. 41 , only the minus strand DNA probe hybridizes with the isolated HCV 
genome. This result, in combination with the result showing that the genome is sensitive to RNase and not 
DNase (See Section IV.C.2.), suggests that the genome of NANBV is positive stranded RNA. 
75 These data, and data from other laboratories concerning the physicochemical properties of a putative 
NANBV(s), are consistent with the possibility that HCV is a member of the Flaviviridae. However, the 
possibility that HCV represents a new class of viral agent has not been eliminated. 

IV.H.2. Detection of Sequences in Captured Particles Which When Amplified by PCR Hybridize to HCV 
20 cDNA Derived from Clone 81 

The RNA in captured particles was obtained as described in Section IV.H.1. The analysis for sequences 
which hybridize to the HCV cDNA derived from clone 81 was carried out utilizing the PCR amplification 
procedure, as described in Section IV.C.3, except that the hybridization probe was a kinased oligonucleotide 
25 derived from the clone 81 cDNA sequence; The results showed that the amplified sequences hybridized 
with the clone 81 derived HCV cDNA probe. 

IV.H.3. H omology Between the Non-Structural Protein of Dengue Flavivirus (MNWWVD1) and the HCV 
P olypeptides Encoded by the Combined ORF of Clones 14i Through 39c 

30 

The combined HCV cDNAs of clones 14i through 39c contain one continuous ORF, as shown in Fig. 26. 
The polypeptide encoded therein was analyzed for sequence homology with the region of the non-structural 
polypeptide(s) in Dengue flavivirus (MNWVD1). The analysis used the Dayhoff protein data base, and was 
performed on a computer. The results are shown in Fig. 42, where the symbol (:) indicates an exact 

35 homology, and the symbol (.) indicates a conservative replacement in the sequence; the dashes indicate 
spaces inserted into the sequence to achieve the greatest homologies. As seen from the figure, there is 
significant homology between the sequence encoded in the HCV cDNA, and the non-structural polypeptide- 
(s) of Dengue virus. In addition to the homology shown in Fig. 42, analysis of the polypeptide segment 
encoded in a region towards the 3'-end of the cDNA also contained sequences which are homologous to 

40 sequences in the Dengue polymerase. Of consequence is the finding that the canonical Gly-Asp-Asp (GDD) 
sequence thought to be essential for RNA-dependent RNA polymerases is contained in the polypeptide 
encoded in HCV cDNA, in a location which is consistent with that in Dengue 2 virus. (Data not shown.) 

IV.H.4. HCV-DNA is Not Detectable in NANBH Infected Tissue 

45 

Two types of studies provide results suggesting that HCV-DNA is not detectable in tissue from an 
individual with NANBH. These results, in conjunction with those described in IV.C. and IV.H.1. and IV.H.2. 
provide evidence that HCV is not a DNA containing virus, and that its replication does not involve cDNA. 

so IV.HAa. Southern Blotting Procedure 

In order to determine whether NANBH infected chimpanzee liver contains detectable HCV-DNA (or 
HCV-cDNA), restriction enzyme fragments of DNA isolated from this source was Southern blotted, and the 
blots probed with 32 P-labeled HCV cDNA. The results showed that the labeled HCV cDNA did not hybridize 
55 to the blotted DNA from the infected chimpanzee liver. It also did not hybridize to control blotted DNA from 
normal chimpanzee liver. In contrast, in a positive control, a labeled probe of the beta-interferon gene 
hybridized strongly to Southern blots of restriction enzyme digested human placental DNA. These systems 
were designed to detect a single copy of the gene which was to be detected with the labeled probe. 
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DNAs were isolated from the livers of two chimpanzees with NANBH. Control DNAs were isolated from 
uninfected chimpanzee liver, and from human placentas. The procedure for extracting DNA was essentially 
according to Maniatis et al. (1982), and the DNA samples were treated with RNAse during the isolation 
procedure. 

5 Each DNA sample was treated with either EcoRI, Mbol, or Hindi (12 micrograms), according to the 
manufacturers directions. The digested DNAs were electrophoresed on 1% neutral agarose gels, Southern 
blotted onto nitrocellulose, and the blotted material hybridized with the appropriate nick-translated probe 
cDNA (3 x 10 6 cpm/ml of hybridization mix). The DNA from infected chimpanzee liver and normal liver were 
hybridized with 32 P-labeled HCV cDNA from clones 36 plus 81; the DNA from human placenta was 

w hybridized with 32 P-labeled DNA from the beta-interferon gene. After hybridization, the blots were washed 
under stringent conditions, i.e., with a solution containing 0.1 x SSC, 0.1% SDS, at 65 *C. 
The beta-interferon gene DNA was prepared as described by Houghton et al (1981). 

IV.HAb. Amplification by the PCR Technique 

In order to determine whether HCV- DNA could be detected in liver from chimpanzees with NANBH, 
DNA was isolated from the tissue, and subjected to the PCR amplification-detection technique using primers 
and probe polynucleotides derived from HCV cDNA from clone 81. Negative . controls were DNA samples 
isolated from uninfected HepG2 tissue culture cells, and from presumably uninfected human placenta. 

20 Positive controls were samples of the negative control DNAs to which a known relatively small amount (250 
molecules) of the HCV cDNA insert from clone 81 was added. 

In addition, to confirm that RNA fractions isolated from the same livers of chimpanzees with NANBH 
contained sequences complementary to the HCV-cDNA probe, the PCR amplification-detection system was 
also used on the isolated RNA samples. 

25 In the studies, the DNAs were isolated by the procedure described in Section IV.HAa, and RNAs were 
extracted essentially as described by Chirgwin et al. (1981). 

Samples of DNA were isolated from 2 infected chimpanzee livers,, from uninfected HepG2 cells, and 
from human placenta. One microgram of each DNA was digested with Hindlll according to the manufac- 
turer's directions. The digested samples were subjected to PCR amplification and detection for amplified 

30 HCV cDNA essentially as described in Section IV.C.3., except that the reverse transcriptase step was 
omitted. The PCR primers and probe were from HCV cDNA clone 81! and are described in Section IV.C.3.. 
Prior to the amplification, for positive controls, a one microgram sample of each DNA was "spiked" by the 
addition of 250 molecules of HCV cDNA insert isolated from clone 81 . 

In order to determine whether HCV sequences were present in RNA isolated from the livers of 

35 chimpanzees with NANBH, samples containing 0.4 micrograms of total RNA were subjected to the 
amplification procedure essentially as described in Section IV.C.3., except that the reverse transcriptase was 
omitted from some of the samples as a negative control. The PCR primers and probe were from HCV cDNA 
clone 81 , as described supra. 

The results showed that amplified sequences complementary to the HCV cDNA probe were not 

40 detectable in the DNAs from infected chimpanzee liver, nor were they : detectable in the negative controls. In 
contrast, when the samples, including the DNA from infected chimpanzee liver, was spiked with the HCV 
cDNA prior to amplification, the clone 81 sequences were detected in all positive control samples. In 
addition, in the RNA studies, amplified HCV cDNA clone 81 sequences were detected only when reverse 
transcriptase was used, suggesting strongly that the results were not due to a DNA contamination. 

45 These results show that hepatocytes from chimpanzees with NANBH contain no, or undetectable levels, 
of HCV DNA. Based upon the spiking study, if HCV DNA is present, it is at a level far below .06 copies per 
hepatocyte. In contrast, the HCV sequences in total RNA from the same liver samples was readily detected 
with the PCR technique. 

so I V.I. ELISA Determinations for HCV Infection Using HCV c100-3 As Test Antigen 

All samples were assayed using the HCV d 00-3 ELISA. This assay utilizes the HCV c1 00-3 antigen 
(which was synthesized and purified as described in Section IV.B.5), and a horseradish peroxidase (HRP) 
conjugate of mouse monoclonal anti-human IgG. 
55 Plates coated with the HCV c100-3 antigen were prepared as follows. A solution containing Coating 
buffer (50m M Na Borate, pH 9.0), 21 ml/plate, BSA (25 microg rams/ml), c100-3 (2.50 microg rams/ml) was 
prepared just prior to addition to the Removeawell Immulon I plates (Dynatech Corp.). After mixing for 5 
minutes, 0.2ml/weil of the solution was added to the plates, they were covered and incubated for 2 hours at 
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37 °C, after which the solution was removed by aspiration. The wells were washed once with 400 microliters 
Wash Buffer (100 mM sodium phosphate, pH 7.4, 140 mM sodium chloride, 0.1% (W/V) casein. 1% (W/V) > 
Triton x-100, 0.01% (W/V) Thimerosal). After removal of the wash solution, 200 microliters/well of Postcoat : 
solution (10 mM sodium phosphate, pH 7.2, 150 mM sodium chloride, 0.1% (w/v) casein and 2 mM 

5 phenylmethylsulfonylfluoride (PMSF)) was added, the plates were loosely covered to prevent evaporation, 
and were allowed to stand at room temperature for 30 minutes. The wells were then aspirated to remove the 
solution, and lyophilized dry overnight, without shelf heating. The prepared plates may be stored at 2-8*C 
in sealed aluminum pouches. 

In order to perform the ELISA determination, 20 microliters of serum sample or control sample was 

io added to a well containing 200 microliters of sample diluent (100 mM sodium phosphate, pH 7.4, 500 mM 
sodium chloride, 1 mM EDTA, 0.1% (W/V) Casein, 0.015 (W/V) Therosal, 1% (W/V) Triton X-100, 100 
micrograms/ml yeast extract). The plates were sealed, and incubated at 37 *C for two hours, after which the 
solution was removed by aspiration, and the wells were washed with 400 microliters of wash buffer 
(phosphate buffered saline (PBS) containing 0.05% Tween 20). The washed wells were/treated with 200 

75 microliters of mouse anti-human IgG-HRP conjugate contained in a solution of Ortho conjugate diluent (10 
mM sodium phosphate, pH 7.2, 150 mM sodium chloride, 50% (V/V) fetal bovine serum, 1% (V/V) heat 
treated horse serum, 1 mM KjFe (CN) 6 , 0.05% (WA/) Tween 20, 0.02% (WA/) Thimerosal). Treatment was 
for 1 hour at 37 *C, the solution was removed by aspiration, and the wells were washed with wash buffer, 
which was also removed by aspiration. To determine the amount of bound enzyme conjugate, 200 

20 microliters of substrate solution (10 mg O-phenylenediamine dihydrochloride per 5 ml of Developer solution) 
was added. Developer solution contains 50 mM sodium citrate adjusted to pH 5.1 with phosphoric acid, and 
0.6 microliters/ml of 30% H2O2. The plates containing the substrate solution were incubated in the dark for 
30 minutes at room temperature, the reactions were stopped by the addition of 50 microliters/ml 4N sulfuric 
acid, and the ODs determined. 

25 The examples provided below show that the microtiter plate screening ELISA which utilizes HCV c100-3 
antigen has a high degree of specificity, as evidenced by an initial rate of reactivity of about 1%, with a 
repeat reactive rate of about 0.5% on random donors. The assay is capable of detecting an im- 
munoresponse in both the post acute phase of the infection, and during the chronic phase of the disease. In 
addition, the assay is capable of detecting some samples which score negative in the surrogate tests for 

30 NANBH; these samples come from individuals with a history of NANBH, or from donors implicated in-! ; 
NANBH transmission. 

In the examples described below, the following abbreviations are used: 



ALT 

Alanine amino transferase 


Anti-HBc 

Antibody against HBc 

35 

Anti-HBsAg 

Antibody against HBsAg 


HBc 

Hepatitis B core antigen 


ABsAg 

Hepatitis B surface antigen 


igG 

Immunoglobulin G 


igM 

Immunoglobulin M 

40 

IU/L 

International units/Liter 


NA 

Not available 


NT 

Not tested 


N 

Neg 

Sample size 
Negative 

45 

OD 

Optical density 


Pos 

Positive 


s/co 

Signal/cutoff 


SD 

Standard deviation 


X 

Average or mean 

50 

WNL 

Within normal limits 


IV.1.1. HCV Infection in a Population of Random Blood Donors 

A group of 1,056 samples (fresh sera) from random blood donors were obtained from Irwin Memorial 
55 Blood Bank, San Francisco, California. The test results obtained with these samples are summarized in a 
histogram showing the distribution of the OD values (Fig. 43). As seen in Fig. 43, 4 samples read >3, 1 
sample reads between 1 and 3, 5 samples read between 0.4 and 1, and the remaining 1,046 samples read 
<0.4, with over 90% of these samples reading <0.1. 
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The results on the reactive random samples are presented in Table 5. Using a cut-off value equal to the 
mean plus 5 standard deviations, ten samples out of the 1 ,056 (0.95%) were initially reactive. Of these, five 
samples (0.47%) repeated as reactive when they were assayed a second time using the ELISA. Table 5 
also shows the ALT and Anti-HBd status for each of the repeatedly reactive samples. Of particular interest 
s is the fact that all five repeat reactive samples were negative in both surrogate tests for NANBH, while 
scoring positive in the HC V ELISA. 

TABLE 5 

w RESULTS ON REACTIVE RANDOM SAMPLES 


15 N * 1051 

i « 0.049* 
SD ■ ♦ 0.074 

Cut-cff: 5 ♦ 5SD * 0.419 (0.400 ♦ negative Control) 


20 




Initial 






Reectives 

Repeat Reactive* 


Anti 


Samples 

OD 

OD 

ALT** 

nu/D 

HBc*** 

Tod) 

25 

4227 

0.462 

0.084 

NA 

NA 


6292 

0.569 

0.294 

MA 

NA 


6188 

0.699 

0.326 

MA 

NA 


6157 

0.735 

0.187 

MA 

NA 


6277 

0.883 

0.152 

MA 

NA 

30 

6397 

1.567 

1.392 

30.14 

1.433 


6019 

>3.000 

>3.000 

46.48 

1.057 


6651 

>3.000 

>3.000 

48.53 

1.343 


6669 

>3.000 

>3.000 

60.53 

1.165 


4003 

>3.000 

3.000 

WML**** 

Negative 

35 


10/1056 > 0.95Z 

5/1056 - 0.471 




* Samples reading >1.5 were not included in calculating the Keen 
end SD 

40 ++ ALT £ 68 IU/L is ebove no roe 1 limits. 

*** Anti-HBc < 0.535 (competition assay) is considered positive. 
'****. WNL: Within normal limits. 


45 

IV.1.2. Chimpanzee Serum Samples 

Serum samples from eleven chimpanzees were tested with the HCV c100-3 ELISA. Four of these 
chimpanzees were infected with NANBH from a contaminated batch of Factor VIII (presumably Hutchinson 

so strain), following an established procedure in a collaboration with Dr. Daniel Bradley at the Centers for 
Disease Control. As controls, four other chimpanzees were infected with HAV and three with HBV. Serum 
samples were obtained at different times after infection. 

The results, which are summarized in Table 6, show documented antibody seroconversion in all 
chimpanzees infected with the Hutchinson strain of NANBH. Following the acute phase of infection (as 

55 evidenced by the significant rise and subsequent return to normal of ALT levels), antibodies to HCV c100-3 
became detectable in the sera of the 4/4 NANBH infected chimpanzees. These samples had previously 
been shown, as discussed in Section IV.B.3., to be positive by a Western analysis, and an RIA. In contrast, 
none of the control chimpanzees which had been infected with HAV or HBV showed evidence of reactivity 
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in the ELISA. 


10 


15 


20 


Negative Control 
Positive Control 
Cutoff 

Chimp 1 


Chimp 2 


25 Chimp 3 


30 


35 


Chimp 4 


Chimp 5 


4 0 Chimp 6 


45 


Chimp 7 


TABLE 6 
OOMPANZEE SERUM SAMPLES 




Inoculation 

Bleed 

ALT 




Date 

Date 

(iu/p 

Tpamsf USE 0 

o.oot 





1.501 






0. 101 






•0.007 

0.00 

05/2*781 

05/21/81 

v 9 

. NANB 

0.003 

0.01 


08/07/81 

71 


>3.000 

>7,18 


09/18/81 

19 


>3.UQU 








06/07/81 

••' ' ™ • 


NANB ■ 

-0.003 

0.00 


05/31/81 

- 5 


-U.005 

0.00 


06/28/81 

52 


t\ SAC ' 

0 v. 





»3.0U0 

>7.18 


10/21/81 



T).TO5 

0.01 

03/11/85 

03/11/85 

8 

' NANB 

0.017 

0.01 


01/26/85 

205 


n Mr 

n ni 

U.UI 


05/06/85 

11 


1. 010 

2.52 


08/20/85 

6 


-0.006 

0.00 

03/11/85 

03/11/85 

11 

urn 

0.003 

0.01 


05/09/85 

132 


0.523 

1.31 


06/06/85 



1.571 

3.93 


08/01/85 



-0.006 

0.00 

11/21/80 

11/21/80 

1 

HAV 

0.001 

0.00 


12/16/80 

117 


0.003 

0.01 


12/30/80 

18 


0.006 

0.01 


07/29 - 08/21/81 

5 




05/25/82 



: . HAV 

-0.005 

0.00 


05/17/82 



0.001 

0.00 


06/10/82 

106 


-0.001 

0.00 


07/06/82 

10 


0.290 

0.72 


10/01/82 



-0*008 

0.00 

05/25/82 

05/25/82 

7 

■ HAV , 

-0.001 

0.00 


06/17/82 

83 


-0.006 

0.00 


09/16/82 

5 


0.005 

0.01 


10/09/82 




50 
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TABLE 6 


CHIMPANZEE SERUM SAMPLES 


(Cont'd) 


10 


15 


20 


25 


30 


Chimp 8 


Chimp 9 


Chimp 10 


Chimp 11 


0D 

-0.007 
0.000 
0.001 
0,000 


0.019 

0.015 
0,008 


0.011 
0.015 
0.008 
0.010 


0.000 

-0.003 
-0.003 
-0.003 


S/CO 

0.00 
0.00 
0.01 
0.00 


0.05 

0.01 
0.02 


0.03 
0.01 
0.0? 
0.02 


0.00 

0.00 
0.00 
0.00 


Imucuiatiom 
Date 

11/21/20 


07/21/80 


Bleed 
Date 


05/12/82 


05/12/82 


11/21/80 
12/16/80 
02/03/81 
06/03 - 06/10/81 


08/22 - 10/10/79 

03/11/81 
07/01 - 08/05/81 

10/01/81 


01/21 - 05/12/82 
09/01 - 09/08/82 

12/02/82 

01/06/83 


01/06 - 05/12/82 
06/23/82 

06/09 - 07/07/82 
10/28/82 
12/20/82 



57 
9 
6 


9 
126 
9 
13 


11 

100 

9 
10 


TRANSFUSED 

IIAV 
HBV 


I1DV 


IIDV 


35 


40 


45 


50 


55 


IV.1.3. Panel 1: Proven Infectious Sera from Chronic Human NANBH Carriers 

A coded panel consisted of 22 unique samples, each one in duplicate, for a total of 44 samples.. The 
samples were from proven infectious sera from chronic NANBH carriers, infectious sera from implicated 
donors, and infectious sera from acute phase NANBH patients: In addition, the samples were from highly 
pedigreed negative controls, and other disease controls. This panel was provided by Dr. H. Alter of the 
Department of Health and Human Services, National Institutes of Health, Bethesda, Maryland. The panel 
was constructed by Dr. Alter several years ago, and has been used by Dr. Alter as a qualifying panel for 
putative NANBH assays. 

The entire panel was assayed twice with the ELISA assay, and the results were sent to Dr. Alter to be 
scored. The results of the scoring are shown in Table 7. Although the Table reports the results of only one 
set of duplicates, the same values were obtained for each of the duplicate samples. 

As shown in Table 7, 6 sera which were proven infectious in a chimpanzee model were strongly 
positive. The seventh infectious serum corresponded to a sample for an acute NANBH case, and was not 
reactive in this ELISA. A sample from an implicated donor with both normal ALT levels and equivocal results 
in the chimpanzee studies was non-reactive in the assay. Three other serial samples from one individual 
with acute NANBH were also non-reactive. All samples coming from the highly pedigreed negative controls, 
obtained from donors who had at least 10 blood donations without hepatitis implication, were non-reactive in 
the ELISA. Finally, four of the samples tested had previously scored as positive in putative NANBH assays 
developed by others, but these assays were not confirmable. These four samples scored negatively with the 
HCV ELISA. 
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TABLE 7 
H. ALTER'S PANEL 1; 

5 


__ Panel _ 1st Result 2nd Result 

io 1) Proven Infectious iy Giimpanzee Transmission 
A, CimoMic NANB; Post-Tx 

JF ♦ 
ED 

PG ♦ ♦ 

75 B. Implicated Donors with Elevateo ALT 

BC : ♦ 

JJ ♦ ♦ 

00 > ♦ 

C. Acute NANB; Post-Tx 

20 VII - - 

2) Equivocally Infectious it Chimpanzee Transmission 
A. Implicated Donor with Normal ALT 

CC - 

3) Acute NAM): Post-Tx 

25 JL Week 1 - - 

JL Week 2 

JL Week 3 - - 

t\) Disease Controls 

A. Primary Biliary Cirrhosis 
30 ek - : • 

0. Alcoholic Hepatitis in Recovery 

110 - 

5) Pedigreed Negative Controls 

DM - : - 

35 DC 

LV . - 

HL 

All - - ' 

6) Potential NANO 'Antigens* 

40 JS-80-01T-0 (Isiiida) " " 

ASTERIX (TREPO) ~ ' 

zurtz (arn0u0) " ' 

Becassdime (Trcpo) 

45 

IV.1.4. Panel 2: Donor/Recipient NANBH 

The coded panel consisted of 10 unequivocal donor-recipient cases of transfusion associated NANBH, 
50 with a total of 188 samples. Each case consisted of samples of some or all the donors to the recipient, and 
of serial samples (drawn 3, 6, and 12 months after transfusion) from the recipient. Also included was a pre- 
bleed, drawn from the recipient before transfusion. The coded panel was provided by Dr. H. Alter, from the 
NIH, and the results were sent to him for scoring. 

The results, which are summarized in Table 8 t show that the ELISA detected antibody seroconversion in 
55 9 of 10 cases of transfusion associated NANBH. Samples from case 4 (where no seroconversion was 
detected), consistently reacted poorly in the ELISA. Two of the 10 recipient samples were reactive at 3 
months post transfusion. At six months, 8 recipient samples were reactive; and at twelve months, with the 
exception of case 4, all samples were reactive. In addition, at least one antibody positive donor was found in 
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7 out of the 10 cases, with case 10 having two positive donors. Also, in case 10, the recipient's pre-bleed 
was positive for HCV antibodies. The one month bleed from this recipient dropped to borderline reactive 
levels, while it was elevated to positive at 4 and 10 month bleeds. Generally, a S/CO of 0.4 is considered 
positive. Thus, this case may represent a prior infection of the individual with HCV. 
5 The ALT and HBc status for all the reactive, i.e., positive, samples are summarized in Table 9. As seen 
in the table, 1/8 donor samples was negative for the surrogate markers and reactive in the HCV antibody 
ELISA. On the other hand, the recipient samples (followed up to 12 months after transfusion) had either 
elevated ALT, positive Anti-HBc, or both. 

w 

TA3TJE 8 
DONOR/RECIPiaJT NAN3 PANEL 


?5 


20 


25 


H. ALTER DONpg/BFriPlfHT HANB PAKEL 


(Use Po"Q» 

00 S/ffl. 

1. — " 

2 . 

3. -403 0.94 

30 

* v ™ 

5. »3.000 >6.96 

6. »3.000 >6.96 

7. >3.000 >6.96 
35 8. >3.000 >6.96 

9. >3.000 >6.96 

10. >3.000 >6.96 
>3.000 >6.96 

40 

' i KOHTH, 4 M>«TKS. ** # 10 NOKTKS 


45 


Recipient 

P»cai rrn 

1 Months 

fi' Months 

12 Mouths 


s/co 


S/Cfl 


S/CO 

-J2L- 

S/CO 

.032 

0.07 

.112 

0.26 

>3.000 

>6.96 

>3.000 

>6.96 

.059 

0.14 

•050 

6.12 

1,681 

3.90 

>3.000 

>6.96 

.009 

o.u 

.057 

0.13 

>3.000 

>6.96 

>3.000 :: 

>6.9o . 

.065 

0.15 

.073 

0.17 

.067 

0.16 

.217 

0.50 

.034 

0.08 

.096 

0.22 

»3.000 

>6.96 

>3.000 

>6.96 

.056 

0.13 

1.175 

3.44 

: >3.000 

>6.96 

>3.000 

>6.96 

.030 

0.08 

.056 

0.13 

>3.000 

►6,96 

>3.000 

>6.96 

.061 

0.14 

.078 

0.18 

2.262 

5.28 

>3.000 

>6.96 

.080 

0.19 

.127 

0.30 

.055 

0.13 

>3.000 

>6.96 

>3.qoo 

>6.96 

.317* 

0.74 

»3.000" 

>6.96 

>3.000"* 

»6.96 
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TABLE * 

ALT AND HBc STATUS FOR REACTIVE SAMPLES IN 
H. ALTER PANEL 1 


70 


75 


20 


25 


30 


35 


40 


45 


50 


Samples 


Donors 


Case 
Case 
Case 
Case 
Case 8 
Case 9 
Case 
Case 


3 
5 
6 
7 


10 
10 


Recipients 


Anti- 
ALT* 


Normal 

Elevated 

Elevated 

Not available 

Normal 

Elevated 

Normal 

Normal 


HBc* 


Negative 

Positive 

Positive 

Negative 

Positive 

Not available 

Positive 

Positive 


Case 

1 


6 mo 

Elevated 

Positive 


12 

mo 

Elevated 

Not tested 


Case 

2 


6 mo 

Elevated 

Negative 


12 

mo 

Elevated 

Not tested 


Case 

3 


6 DO 

Normal 

Not tested*** 


12 

mo 

Elevated :' 

Not tested*** 


Case 

5 


6 mo 

Elevated 

Not tested 


* 2 

mo 

Elevated 

Not tested 


Case 

6 


3 mo 

Elevated 

Negative 


6 

mo 

Elevated 

Negative 



12 

mo 

Elevated 

Not tested 


Case 

7 


6 DO 

Elevated 

Negative 


12 

mo 

Elevated 

Negative 


Case 

8 


6 mo 

Normal 

Positive 


12 

mo 

Elevated 

Not tested 


Case 

9 


12 mo 

Elevated 

Not tested 

Case 

10 


4 mo 

Elevated 

Not tested 


10 

mo 

Elevated 

Not tested 



* ALT IU/L is above normal limits* 
** Anti-HBc $501 (competition assay) is considered positive, 
*** Prebieed and 3 mo samples were negative for HBc • 


55 IV.1.5. Determination of HCV Infection in High Risk Group Samples 

Samples from high risk groups were monitored using the ELISA to determine reactivity to HCV c1 00-3 
antigen. These samples were obtained from Dr. Gary Tegtmeier, Community Blood Bank, Kansas City. The 
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results are summarized in Table 10. 

As shown in the table, the samples with the highest reactivity are obtained from hemophiliacs (76%). In 
addition, samples from individuals with elevated ALT and positive for Anti-HBc, scored 51% reactive, a 
value which is consistent with the value expected from clinical data and NANBH prevalence in this group. 
5 The incidence of antibody to HCV was also higher in blood donors with elevated ALT alone, blood donors 
positive for antibodies to Hepatitis B core alone, and in blood donors rejected for reasons other than high 
ALT or anti-core antibody when compared to random volunteer donors. 

TABLE 10 


35 


40 


NANBH HIGH RISK GROUP SAMPLES 

Group 

N 

Distribution 

% Reactive 

N 

on 

elevated al i 


n 79Q 

3 

>3.000 

1 1 .4% 

Anti-HBc 


24 

c 
O 


20.8% 

Elevated ALT, Anti-HBc 


OO 


■^o nnn 
>o.UUU 

■ O I .O /o 


1 

2.768 





1 

2.324 





1 

0.939 





1 

0.951 





1 

0.906 




Rejected Donors 


25 

5 

>3.000 

20.0% 

Donors with History of Hepatitis 


150 

19 

>3.000 

14.7% 


1 

0.837 





1 

0.714 





1 

0.469 




Haemophiliacs 


50 

31 

>3.000 

76.0% 


1 

2.568 





1 

2.483 





1 

2.000 





1 

1.979 





1 

1.495 





1 

1.209 





1 

0.819 





IV.I.6 Comparative Studies U sing Anti-IgG or Anti-IgM Monoclonal Antibodies, or Polyclonal Antibodies as a 
Second Antibody in the HCV c100-3 ELISA 

45 

The sensitivity of the ELISA determination which uses the anti-IgG monoclonal conjugate was compared 
to that obtained by using either an anti-IgM monoclonal conjugate, or by replacing both with a polyclonal 
antiserum reported to be both heavy and light chain specific. The following studies were performed. 

so IV.I.6.a. Serial Samples from Seroconverters 

Serial samples from three cases of NANB seroconverters were studied in the HCV c1 00-3 ELISA assay 
using in the enzyme conjugate either the anti-IgG monoclonal alone, or in combination with an anti-IgM 
monoclonal, or using a polyclonal antiserum. The samples were provided by Dr. Cladd Stevens, N.Y. Blood 
55 Center, N.Y.C., N.Y.. The sample histories are shown in Table 11. 

The results obtained using an anti-IgG monoclonal antibody-enzyme conjugate are shown in Table 12. 
The data shows that strong reactivity is initially detected in samples 1-4, 2-8, and 3-5, of cases 1 , 2, and 3, 
respectively. 
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The results obtained using a combination of an anti-IgG monoclonal conjugate and an anti-IgM 
conjugate are shown in Table 13. Three different ratios of anti-IgG to anti-IgM were tested; the 1:10,000 
dilution of anti-IgG was constant throughout. Dilutions tested for the anti-IgM monoclonal conjugate were 
1:30,000, 1:60,000, and 1:120,000. The data shows that, in agreement with the studies with anti-IgG alone, 

5 initial strong reactivity is detected in samples 1-4, 2-8, and 3-5. 

The results obtained with the ELISA using anti-IgG monoclonal conjugate (1:10,000 dilution), or Tago 
polyclonal 'conjugate (1:80,000 dilution), or Jackson polyclonal conjugate (1:80,000 dilution) are shown in : 
Table 14. The data indicates that initial strong reactivity is detected in samples 1-4, 2-8, and 3-5 using all 
three configurations; the Tago polyclonal antibodies yielded the lowest signals. 

to The results presented above, show that all three configurations detect reactive samples at the same time 
after the acute phase of the disease (as evidenced by the ALT elevation). Moreover, the results indicate that 
the sensitivity of the HCV c100-3 ELISA using anti-IgG monoclonal-enzyme conjugate is equal to or better 
than that obtained using the other tested configurations for the enzyme conjugate. 


20 


25 



45 


55 
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TABLE 11 

DESCRIPTION OF SAMPLES FROM CLAD D STEVENS PANEL 

5 

0«c« HBiAg : Anci-HBt Ahci-HBc ALT Bilirupin 

10 1 


75 


25 


40 


45 


1-1 

J ' 

o/3/Bl 

1.0 

91.7 

12.9 

40.0 

-1.0 

1-2 

A /ft 1 

9/2/01 

1.0 

121.0 

13.1 

274.0 

.1.4 

1-3 

1U/7/01 

1*0 

44.0 

23.8 

261.0 

0.9 

1-4 

li/19/81 

1.0 

47.3 

33.8 

73.0 

0.9 

.1-5 

12715/81 

1,0 

30.3 

27.6 

71.0 

1.0 

Cw 2 







2-1 

lO/iy/Bi 

1.0 

1.0 

116.2 

17.0 

-1.0 

2-2 

1 1 /IT/11 
11/17/01 

1.0 

0.8 

89.3 

46.0 

•: 1. 1 

2-3 

1Z/U2/01 

1*0 

1.2 

78.3 

63.0 

1.4 

2-4 

12/14/01 

1.0 

0.9 

90.6 

132.0 

1.4 

2-5 

12/22/01 

1.0 

0.8 

93.6 

624.0 

1.7 

2-6 

1/20/02 

1.0 

0*8 

92.9 

66.0 

*-3 

^ /. 

2/19/02 

1.0 

0.8 

86.7 

70.0 

1.3 

"D — 0 

J/17/02 

1.0 

0.9 

69.8 

24.0 

-1.0 

2-9 

4/21/82 

1.0 

0.9 

67.1 

53.0 

1.5 

2-10 

3/19/82 

1.0 

0.3 

74.8 

95.0 

1.6 

2-11 

8/14/82 


0.8 

82.9 

: 37.0 

-1.0 

C.I. 3 







3-1 

4/7/81 

1.0 


88.4 

13.0 

-1.0 

3-2 

3/12/01 

1.0 

1.1 

126.2 

236.0 

0.4 

3-3 

3/30/81 

1.0 

0.7 

99.9 

471.0 

0.2 

3-4 

8/9/81 

1.0 

1.2 

110.8 

313.0 

0.4 

3-5 

7/4/01 

1.0 

1.1 

89.9 

273.0 

0.4 

3-6 

8/10/81 

1.0 

1.0 

118.2 

158.0 

0.4 

3-7 

9/8/81 

1.0 

1.0 

112.3 

84.0 

0.3 

3-8 

10/14/81 

1.0 

0.9 

102.3 

180.0 

0.5 

3-9 

11/11/81 

1.0 

1.0 

84.4 

134.0 

0.3 
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TABLE 12 

ELISA RESULTS OBTAINED USING AN ANTI-IgG MONOCLONAL CONJUGATE 


SAMPLE 

BAT? , 

fill 

OS 

s/co 

Neg Control 



.076 


Cutoff 



.476 


PC (1:128) 



1.390 







1-1 

08/05/81 

40.0 

.178 

.37 

1-2 

MM #M 4 ill • 

09/02/81 

274.0 

.154 

.32 

l-j 

10/07/81 

251.0 

t ^A 

.129 

.27 

1-4 

11/19/81 

75.0 

.937 

1.97 

t-5 

12/15/81 

71.0 

>3.000 

>6.30 

Cm n 





2-1 

10/19/81 

17.0 

.058 

0.12 

2-2 

11/17/81 • 

46.0 

.050 

0.11 

2-3 

12/02/81 

63.0 

.047 

0.10 

2-0 

12/14/81 

152.0 

.059 

0.12 

2-5 

12/23/81 

624.0 

.070 

0.15 

2-6 

01/20/82 

66.0 

.051 

0.11 

2-7 

02/15/82 

70.0 

.139 

0.29 

2-8 

03/17/82 

24.0 

1.867 

3.92 

2-9 

04/21/82 

53.0 

>3.000 

>6.30 

2-10 

05/19/82 

9S.0 

>3.000 

>6.30 

2-11 

06/14/82 

37.0 

>3.000 

>6.30 

C«H *3 





3-1 

04/07/81 

13.0 

.090 

.19 

3-2 

05/12/81 

236.0 

.064 

.13 

3-3 

05/30/81 

471.0 

.079 

.17 

3-* 

06/09/81 

315.0 

.211 

.44 

3-5 

07/06/81 

273.0 

1.707 

3.59 

3-6 

08/10/81 

1S8.0 

>3.000 

>6.30 

3-7 

09/08/81 

84.0 

>3.000 

>6.30 

3-8 

10/14/81 

180.0 

>3.000 

>6.30 

3-9 

ll/U/81 

154.0 

>3.000 

»6.30 
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TABLE 13 


ELISA RESULTS OBTAINED USING ANTI-IgG 


and ANTI-IgM 


MONOCLONAL CONJUGATE 


NANB EUSAs 





HOHOCIONALS 

HONOCLOWALS 

HONOCLONALS 




IcG 1:I0K 

IgG 1:10* 

IgG I:10K 




UM 1:30* 

IgN 1:60K 

IgM 1:120K 

SAMPLE 

DATE 

AH 

OS S£fl 


OS : ; 5i£Q 

MtS COMTROt 



.100 

.080 


Cutoff 






PC (1:128) 



1.083 

1.328 

1.197 

C»5f »\ 






l-l 

08/05/81 

40 

.173 

.162 

.070 

1-2 

09/02/81 

274 

.194 

: : :: .111 : 

.079 

1-3 

10/07/81 

261 

.162 

.129 

.063 

1-4 

11/19/81 

75 

.812 

.85 

.709 

1-5 

12/15/81 

71 

>3.00 

►3.00 

>3.00 


Case #2 


2-1 

10/19/81 

17 

.442 

.04S 

2-2 

11/17/81 

46 

.102 

.029 

2-3 

12/02/81 

63 

.059 

.036 

2-4 

12/M/81 

152 

.065 

.041 

2-5 

12/23/81 

624 

.082 

.033 

2-6 

01/20/82 

66 

.102 

.042 

2-7 

02/15/82 

70 

.188 

.068 

2-8 

03/17/82 

24 

1.728 

1.668 

2-9 

04/21/82 

53 

>3.00 

2.443 

2-10 

05/19/82 

95 

>3.00 

>3.00 

2-11 

06/14/82 

37 

>3.00 

>3.00 


.035 
.030 
.027 
.025 
.032 
.027 
.096 
1.541 
»3.00 
>3.00 
>3.00 


3-1 

04/07/81 

13 

.193 

.076 

.049 

3-2 

05/12/81 

236 

.201 

J0S1 

.038 

3-3 

05/30/81 

471 

.132 

.067 

: .052 

3-4 

06/09/81 

315 

.175 

.155 

.140 

3-5 

07/06/81 

273 

1.335 

1.238 

1.260 

3-6 

08/10/81 

158 

>3.00 

>3.00 

>3.00 

3-7 

09/08/81 

84 

>3.00 

»3.00 

■ : >3.00 

3-8 

10/14/81 

180 

>3.00 

>3.00 

>3.00 

3-9 

11/11/81 

154 

>3.00 

>3.00 

>3.00 
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TABLE 14 

ELISA RESULTS OBTAINED USING POLYCLONAL CONJUGATES 

"AWB £LISA« 
MONOCLONAL TAGO / JACKSON 
1:10K 1:80K t :80K 


SAHPLE 

m 

ALT 

_2P_ 

s/co 

jjp_ 

S/CO 

JL 

S/CO 

Ntc Control 


.076 


.015 


.151 


Cutoff 



.076 


.515 


.651 


PC (1:128) 



1.390 


J21 


2J5J 


c*« n 









i-i 

08/05/81 

40 

.178 

.37 

.067 

.12 

.IS3 

.23 

1-2 

09/02/81 

27 4 

.151 

.32 

.097 

.18 

.225 

.31 

1-3 

10/07/81 

261 

.129 

.27 

.026 

.05 

.167 

.26 

1-1 

11/19/81 

75 

.937 

1.97 

.321 

.60 

.793 

1.21 

1-5 

12/15/81 

71 

>3.00 

>6.30 

1.778 

3.27 

»3.00 

>4.59 


C*iil? 


2-1 

10/19/81 

17 

.058 

.12 

.023 

.01 

.052 

.08 

2-2 

11/17/81 

46 

.050 

.11 

.018 

.03 

.058 

.09 

2-3 

12/02/81 

63 

.047 

.10 

.020 

.04 

.060 

.09 

2-1 

12/11/81 

152 

.059 

.12 

.025 

, .05 

.054 

.08 

2-5 

12/23/81 

624 

.070 

.15 

.026 

.05 

.074 

.11 

2-6 

01/20/82 

'66 

.051 

.11 

.018 

.03 

.058 

.09 

2-7 

02/15/82 

70 

.139 

.29 

.037 

.07 

.146 

.22 

2-8 

03/17/82 

24 

1.867 

3.92 

.355 

.65 

1.429 

2.19 

2-9 

01/21/82 

53 

>3.00 

>6.30 

.718 

1.37 

>3.00 

>4.59 

2-10 

05/19/82 

95 

>3.00 

>6.30 

1.025 

1.88 

>3.00 

>4.59 

2-11 

OC/ 10/82 

37 

>3.00 

>6.30 

.917 

1.68 

>3.00 

>4.59 










3-1 

01/07/81 

13 

.090 

.19 

;049 

.09 

.138 

: . .21 

3-2 

05/12/81 

236 

.064 

.13 

.010 

.07 

;v> .094 

.14 

3-3 

05/30/81 

171 

.079 

.17 

.015 

.08 

.144 

.22 

3-1 

06/09/81 

315 

.211 

.11 

.085 

.16 

.275 

.42 

3-5 

07/06/81 

273 

1.707 

3.59 

.272 

.50 

1.773 

2.71 

3-6 

08/10/81 

158 

>3.00 

>6.30 

1.347 

2.47 

>3.00 

>4.59 

3-7 

09/08/81 

84 

>3.00 

>6.30 

2.291 

4.21 

>3.00 

>4.59 

3-8 

10/11/81 

180 

>3.00 

>6.30 

>3.00 

>5.50 

>3.00 

>4.59 

3-9 

11/11/81 

154 

»3.00 

>6.30 

>3.00 

>5.50 

>3.00 

»4.59 


IV.I.6.b. Samples from Random Blood Donors 

Samples from random blood donors (See Section iy.1.1.) were screened for HCV infection using the 
HCV dOO-3 ELISA, in which the antibody-enzyme conjugate was either an anti-IgG monoclonal conjugate, 
or a polyclonal conjugate. The total number of samples screened were 1077 and 1056, for the polyclonal 
conjugate and the monoclonal conjugate, respectively. A summary of the results of the screening is shown 
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in Table 15, and the sample distributions are shown in the histogram in Fig. 44. 

The calculation of the average and standard deviation was performed excluding samples that gave a 
signal over 1.5, i.e., 1073 OD values were used for the calculations utilizing the polyclonal conjugate, and 
1051 for the anti-IgG monoclonal conjugate. As seen in Table 15, when the polyclonal conjugate was used, 

5 the average was shifted from 0.0493 to 0.0931, and the standard deviation was increased from 0.074 to 
0.0933. Moreover, the results also show that if the criteria of x +5SD is employed to define the assay 
cutoff, the polyclonal-enzyme conjugate configuration in the ELISA requires a higher cutoff value. This 
indicates a reduced assay specificity as compared to the monoclonal system. In addition, as depicted in the 
histogram in Fig. 44, a greater separation of results between negative and positive distributions occurs when 

10 random blood donors are screened in an ELISA using the anti-IgG monoclonal conjugate as compared to 
the assay using a commercial polyclonal label. 

TABLE 15 


COMPARISON OF TWO ELISA CONFIGURATIONS IN TESTING SAMPLES FROM RANDOM BLOOD 


DONORS 


CONJUGATE 

POLYCLONAL (Jackson) 

ANTI-IgG MONOCLONAL 

Number of samples 

1073 

1051 

Average (x) 

0.0931 

0.04926 

Standard deviation (SD) 

0.0933 

0.07427 

5SD 

0.4666 

0.3714 

CUT-OFF (5 SD + x) 

0.5596 

0.4206 


I V.J. De tection of HCV Seroconversion in NANBH Patients from a Variety of Geographical Locations 

Sera from patients who were suspected to have NANBH based upon elevated ALT levels, and who were 
30 negative in HAV and HBV tests were screened using the RIA essentially as described in Section IV.D., 
except that the HCV C100-3 antigen was used as the screening antigen in the microtiter plates. As seen 
from the results presented in Table 16, the RIA detected positive samples in a high percentage of the 
cases. 

35 Table 16 


Seroconversion Frequencies for Anti-c100-3 Among NANBH Patients in Different Countries 

Country 

The Netherlands 

Italy 

Japan 

No. Examined 

5 

36 

26 

No. Positive 

3 

29 

19 

% Positive 

60 

80 

73 


45 

IV.K. Detection of HCV Seroconversion in Patients with "Community Acquired" NANBH 

Sera which was obtained from 100 patients with NANBH, for whom there was no obvious transmission 
route (i.e., no transfusions, i.v. drug use, promiscuity, etc. were identified as risk factors), was provided by 

so Dr. M. Alter of the Center for Disease Control, and Dr. J. Dienstag of Harvard University. These samples 
were screened using an RIA essentially as described in Section IV.D., except that the HCV c100-3 antigen 
was used as the screening antigen attached to the microtiter plates. The results showed that of the 100 
serum samples, 55 contained antibodies that reacted immunologically with the HCV c100-3 antigen. 

The results described above suggest that "Community Acquired" NANBH is also caused by HCV. 

55 Moreover, since it has been demonstrated herein that HCV is related to Flaviviruses, most of which are 
transmitted by arthropods, it is suggestive that HCV transmission in the "Community Acquired" cases also 
results from arthropod transmission. 
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IV.L Comparison of Incidence of HCV Antibodies and Surrogate Markers in Donors Implicated in NANBH i ; 
Transmission 

A prospective study was carried out to determine whether recipients of blood from suspected NANBH 

5 positive donors, who developed NANBH, seroconverted to anti-HCV-antibody positive. The blood donors 
were tested for the surrogate marker abnormalities which are currently used as markers for NANBH 
infection, i.e., elevated ALT levels, and the presence of anti-core antibody. In addition, the donors were also 
tested for the presence of anti-HCV antibodies. The determination of the presence of anti-HCV antibodies 
was determined using a radioimmunoassay as described in Section IV.K. The results of the study are 

10 presented in Table 17, which shows: the patient number (column 1); the presence of anti-HCV antibodies in 
patient serum (column 2); the number of donations received by the patient, with each donation being from a 
different donor (column 3); the presence of anti-HCV antibodies in donor serum (column 4); and the 
surrogate abnormality of the donor (column 5) (NT or -- means not tested) (ALT is elevated transaminase, 
and ANTI-HBc is anti-core antibody). 

75 The results in Table 17 demonstrate that the HCV antibody test is more accurate in detecting infected 
blood donors than are the surrogate marker tests. Nine out of ten patients who developed NANBH 
symptoms tested positive for anti-HCV antibody seroconversion. Of the 11 suspected donors, (patient 6 : 
received donations from two different individuals suspected of being NANBH carriers), 9 were positive for 
anti-HCV antibodies, and 1 was borderline positive, and therefore equivocal (donor for patient 1). In contrast, 

20 using the elevated ALT test 6 of the ten donors tested negative/ and using the anticore-antibody test 5 of 
the ten donors tested negative. Of greater consequence, though, in three. cases (donors to patients 8, 9, and 
10) the ALT test and the ANTI-HBc test yielded inconsistent results. 
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IV.M. Amplification for Cloning of HCV cDNA Sequences Utilizing the PCR and Primers Derived from 
Conserved Regions of Flavivirus Genomic Sequences 


50 


55 


The results presented supra., which suggest that HCV is a flavivirus or flavi-like virus, allows a strategy 
for cloning uncharacterized HCV cDNA sequences utilizing the PCR technique, and primers derived from 
the regions encoding conserved amino acid sequences in flaviviruses. Generally, one of the primers is 
derived from a defined HCV genomic sequence, and the other primer which flanks a region of unsequenced 
HCV polynucleotide is derived from a conserved region of the flavivirus genome. The flavivirus genomes 
are known to contain conserved sequences within the NS1 , and E polypeptides, which are encoded in the 
5'-region of the flavivirus genome. Corresponding sequences encoding these regions lie upstream of the 
HCV cDNA sequence shown in Fig. 26. Thus, to isolate cDNA sequences derived from this region of the 
HCV genome, upstream primers are designed which are derived from the conserved sequences within 
these flavivirus polypeptides. The downstream primers are derived from , an upstream end of the known 
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portion of the HCV cDNA. 

Because of the degeneracy of the code, it is probable that there will be mismatches between the 
flavivirus probes and the corresponding HCV genomic sequence. Therefore a strategy which is similar to 
the one described by Lee (1988) is used. The Lee procedure utilizes mixed oligonucleotide primers 

5 complementary to the reverse translation products of an amino acid sequence; the sequences in the mixed 
primers takes into account every codon degeneracy for the conserved amino acid sequence. 

Three sets of primer mixes are generated, based on the amino add homologies found in several 
flaviviruses, including Dengue-2,4 (D-2,4), Japanese Encephalitis Virus (JEV), Yellow Fever (YF), and West 
Nile Virus (WN). The primer mixture derived from the most upstream conserved sequence (5 f -1), is based 

to upon the amino acid sequence gly-trp-gly, which is part of the conserved sequence asp-arg-gly-trp-gly- 
aspN found in the E protein of D-2, JEV, YF, and WN. The next primer mixture <5'-2) is based upon a 
downstream conserved sequence in E protein, phe-asp-gly-asp-ser-tyr-ileu-phe-gly-asp-ser-tyr-ileu, and is 
derived from phe-gly-asp; the conserved sequence is present in D-2, JEV, YF, and WN. The third primer 
mixture (5'-3), is based on the amino acid sequence arg-ser-cys, which is part of the conserved sequence 

75 cys-cys-arg-ser-cys in the NS1 protein of D-2 t D-4, JEV, YF, and WN. The individual primers which form 
the mixture in 5'-3 are shown in Fig. 45. In addition to the varied sequences derived from conserved region, 
each primer in each mixture also contains a constant region at the 5'-end which contains a sequence 
encoding sites for restriction enzymes, Hindlll, Mbol, and EcoRI. 

The downstream primer, ssc5h20A, is derived from a nucleotide sequence in clone 5h, which contains 

20 HCV cDNA with sequences with overlap those in clones 14i and 11b. The sequence of ssc5h20A is 
5* GTA ATA TGG TGA CAG AGT CA 3\ 
An alternative primer, ssc5h34A, may also be used. This primer is derived from a sequence in clone 5h, 
and in addition contains nucleotides at the 5'-end which create a restriction enzyme site, thus facilitating 
cloning. The sequence of ssc5h34A is 

25 5' GAT CTC TAG AGA AAT CAA TAT GGT GAC AGA GTC A 3\ 

The PCR reaction, which was initially described by Saiki et al. (1986), is carried out essentially as 
described in Lee et al. (1988), except that the template for the cDNA is RNA isolated from HCV infected 
chimpanzee liver, as described in Section IV.C.2., or from viral particles isolated from HCV infected 
chimpanzee serum, as described in Section IV.A.1. In addition, the annealing conditions are less stringent in 

30 the first round of amplification (0.6M NaCI, and 25 * C), since the part of the primer which will anneal to the 
HCV sequence is only 9 nucleotides, and there could be mismatches. Moreover, if ssc5h34A is used, the 
additional sequences not derived from the HCV genome tend to destabilize the primer-template hybrid. 
After the first round of amplification, the annealing conditions can be more stringent (0.066M NaCI, and 
32 * C-37 • C), since the amplified sequences now contain regions which are complementary to, or duplicates 

35 of the primers. In addition, the first 10 cycles of amplification are run with Klenow enzyme I, under 
appropriate PCR conditions for that enzyme. After the completion of these cycles, the samples are 
extracted, and run with Taq polymerase, according to kit directions, as furnished by Cetus/Perkin-Elmer. 

After the amplification, the amplified HCV cDNA sequences are detected by hybridization using a probe 
derived from clone 5h. This probe is derived from sequences upstream of those used to derive the primer, 

40 and does not overlap the sequences of the clone 5h derived primers. The sequence of the probe is 

5 ' CCC AGC GGC GTA CGC GCT GGA CAC GGA GGT GGC CGC GTC 
45 GTG TGG CGG TGT TGT TCT CGT CGG GTT GAT GGC GC 3' . 


IV.N.1 . Creation of HCV cDNA Library from liver of a Chimpanzee with infectious NANBH 

50 

An HCV cDNA library was created from liver from the chimpanzee from which the HCV cDNA library in 
Section IV.A.1. was created. The technique for creating the library was similar to that in Section IV.A.24, 
except for this different source of the RNA, and that a primer based on the sequence of HCV cDNA in clone 
11b was used. The sequence of the primer was 
55 5* CTG GCT TGA AGA ATC 3\ 
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IV.N.2. Isolation and nucleotide sequence of overlapping HCV cDNA in clone k9-1 to cDNA in clone 11b 

Clone k9-1 was isolated from the HCV cDNA library created from the liver of an NANBH infected 
chimpanzee, as described in Section IV.A.25. The library was screened for clones which overlap the 

5 sequence in clone 11b, by using a clone which overlaps clone 11b at the S'-terminus, clone 11e. The 
sequence of clone 11b is shown in Fig. 23. Positive clones were isolated with a frequency of 1 in 500,000. 
One isolated clone, k9-1, was subjected to further study. The overlapping nature of the HCV cDNA in clone 
k9-1, to the 5'-end of the HCV-cDNA sequence in Fig. 26 was confirmed by probing the clone with clone 
Alex 46; this latter clone contains an HCV cDNA sequence of 30 base pairs which corresponds to those 

70 base pairs at the 5'terminus of the HCV cDNA in clone 14i, described supra.. 

The nucleotide sequence of the HCV cDNA isolated from clone kSM was determined using the 
techniques described supra. The sequence of the HCV cDNA in clone k9-1 , the overlap with the HCV cDNA 
in Fig. 26, and the amino acids encoded therein are shown in Fig. 46. 

The HCV cDNA sequence in clone k9-1 has been aligned with those of the clones described in Section 

75 IV.A.19 to create a composite HCV cDNA sequence, with the k9-1 sequence being placed upstream of the 
sequence shown in Fig. 32. The composite HCV cDNA which includes the k9-1 sequence and the amino 
acids encoded therein is shown in Fig. 47. 

The sequence of the amino acids encoded in the S'-region of HCV cDNA shown in Fig. 47 has been 
compared with the corresponding region of one of the strains of Dengue virus, described supra., with 

20 respect to the profile of regions of hydrophobicity and hydrophilicity. This comparison showed that the 
polypeptides from HCV and Dengue encoded in this region, which corresponds to the region encoding NS1 
(or a portion thereof), have a similar hydrophobic/hydrophilic profile. 

The information provided infra, allows the identification of HCV strains. The isolation and characteriza- 
tion of other HCV strains may be accomplished by isolating the nucleic acids from body components which 

25 contain viral particles, creating cDNA libraries using polynucleotide probes based on the HCV cDNA probes 
described infra., screening the libraries for clones containing HCV cDNA sequences described infra., and 
comparing the HCV cDNAs from the new isolates with the cDNAs described infra. The polypeptides 
encoded therein, or in the viral genome, may be monitored for immunological cross-reactivity utilising the 
polypeptides and antibodies described supra. Strains which fit within the parameters of HCV, as described 

30 in the Definitions section, supra., are readily identifiable. Other methods for identifying HCv strains will be 
obvious to those of skill in the art, based upon the information provided herein. 

Industrial Applicability 

35 The invention, in the various manifestations disclosed herein, has many industrial uses, some of which 
are the following. The HCV cDNAs may be used for the design of probes for the detection of HCV nucleic 
acids in samples. The probes derived from the cDNAs may be used to detect HCV nucleic acids in, for 
example, chemical synthetic reactions. They may also be used in screening programs for anti-viral agents, 
to determine the effect of the agents in inhibiting viral replication in cell culture systems, and animal model 

40 systems. The HCV polynucleotide probes are also useful in detecting viral nucleic acids in humans, and 
thus, may serve as a basis for diagnosis of HCV infections in humans. 

In addition to the above, the cDNAs provided herein provide information and a means for synthesizing 
polypeptides containing epitopes of HCV. These polypeptides are useful in detecting antibodies to HCV 
antigens. A series of immunoassays for HCV infection, based on recombinant polypeptides containing HCV 

4$ epitopes are described herein, and will find commercial use in diagnosing HCV induced NANBH, in 
screening blood bank donors for HCV-caused infectious hepatitis, and also for detecting contaminated blood 
from infectious blood donors. The viral antigens will also have utility in monitoring the efficacy of anti-viral 
agents in animal model systems. In addition, the polypeptides derived from the HCV cDNAs disclosed 
herein will have utility as vaccines for treatment of HCV infections. 

so The polypeptides derived from the HCV cDNAs, besides the above stated uses, are also useful for 
raising anti-HCV antibodies. Thus, they may be used in anti-HCV vaccines. However, the antibodies 
produced as a result of immunization with the HCV polypeptides are also useful in detecting the presence 
of viral antigens in samples. Thus, they may be used to assay the production of HCV polypeptides in 
chemical systems. The anti-HCV antibodies may also be used to monitor the efficacy of anti-viral agents in 

55 screening programs where these agents are tested in tissue culture systems. They may also be used for 
passive immunotherapy, and to diagnose HCV caused NANBH by allowing the detection of viral antigen(s) 
in both blood donors and recipients. Another important use for anti-HCV antibodies is in affinity chromatog- 
raphy for the purification of virus and viral polypeptides. The purified virus and viral polypeptide prepara- 
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tions may be used in vaccines. However, the purified virus may also be useful for the development of cell 
culture systems in which HCV replicates. 

Cell culture systems containing HCV infected cells will have many uses. They can be used for the 
relatively large scale production of HCV, which is normally a low titer virus/ These systems will also be 
5 useful for an elucidation of the molecular biology of the virus, and lead to the development of anti-viral 
agents. The cell culture systems will also be useful in screening for the efficacy of antiviral agents. In 
addition, HCV permissive cell culture systems are useful for the production of attenuated strains of HCV. 

For convenience, the anti-HCV antibodies and HCV polypeptides, whether natural or recombinant, may 
be packaged into kits. 

w The method used for isolating HCV cDNA, which is comprised of preapring a cDNA library derived from 
infected tissue of an individual, in an expression vector, and selecting clones which produce the expression 
products which react immunologically with antibodies in antibody-containing body components from other 
infected individuals and not from non-infected individuals, may also be applicable to the isolation of cDNAs 
derived from other heretofore uncharacterized disease-associated agents which are comprised of a genomic 

75 component. This, in turn, could lead to isolation and characterization of these agents, and to diagnostic 
reagents and vaccines for these other disease-associated agents. ■ 

Claims 

Claims for the following Contracting States : AT, BE, CH, DE, FR, GB, IT, LI, LU, NL, SE 

20 

1. A polypeptide in substantially isolated form comprising a contiguous sequence of at least 10 amino 
acids encoded by the genome of hepatitis C virus (HCV) and comprising an antigenic determinant, 
wherein HCV is characterized by: 

(i) a positive stranded RNA genome; 
25 (ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14. 

2. A polypeptide according to claim 1 wherein said polyprotein comprises an amino acid sequence having 
30 at least 60% homology to the 859 amino acid sequence in Figure 14. 

3. A polypeptide according to claim 1 or 2 comprising at least 15 amino acids. 

4. A polypeptide according to any one of the preceding claims prepared by recombinant DNA expression. 

5. A polypeptide according to any one of claims 1 to 3 prepared by chemical synthesis. 

6. A polypeptide according to any one of claims 1 to 5 wherein said contiguous sequence is found in 
Figure 14. 

40 ; ; 

7. A polypeptide according to any one of claims 1 to 5 wherein said contiguous sequence is found in 
Figure 47. 

8. A polypeptide according to any one of claims 1 to 5 wherein said contiguous sequence is encoded 
45 within the Iambda-gt11 cDNA library deposited with the American type Culture Collection (ATCC) 

under accession no. 40394. 

9. A polypeptide according to any one of claims 1 to 8 wherein said contiguous sequence is from a 
nonstructural viral protein. 

so 

10. A polypeptide according to any one of claims 1 to 5, 7 or 8 wherein said contiguous sequence is from 
a structural viral protein. 

11. A polypeptide according to any one of claims. 1 to 8 whose sequence is as shown in any one of 
55 Figures 1, 3 to 32, 36, 46 and 47, or whose sequence is encoded in a polynucleotide selectively 

hybridisable with the polynucleotide as shown in any one of Figures 1, 3 to 32, 36, 46 or 47. 

12. A polypeptide according to any of claims 1 to 11 wherein the polypeptide is fixed to a solid phase. 
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13. An immunoassay kit comprising a polypeptide according to any one of claims 1 to 12 in a suitable 
container. 

14. A composition comprising a polypeptide in substantially isolated form according to any one of claims 1 
s to 1 1 mixed with a pharmaceutical ly acceptable excipient. 

15. A vaccine composition according to claim 14. 

16. An immunoassay for detecting antibody against hepatitis C virus (HCV) (anti-HCV antibody), which 
io immunoassay comprises: 

(a) providing a polypeptide comprising an antigenic determinant bindable by said anti-HCV antibody, 
wherein said antigenic determinant comprises a contiguous amino acid sequence encoded by the 
HCV genome and wherein HCV is characterized by: 

(i) a positive stranded RNA genome; 
75 (ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 
859 amino acid sequence in Figure 14, 

(b) incubating a biological sample with said polypeptide under conditions that allow for the formation 
of antibody-antigen complex; and 

(c) determining whether antibody-antigen complex comprising said polypeptide is formed. 

17. An immunoassay according to claim 16 wherein the polypeptide is prepared by recombinant DNA 
expression. 

18. An immunoassay according to claim 16 wherein the polypeptide is prepared by chemical synthesis. 

19. An immunoassay according to any one of claims 16 to 18 wherein said polypeptide is attached to a 
solid support. 

20. An immunoassay according to any one of claims 16 to 19 wherein said antibody-antigen complexes are 
detected by incubating the complexes with a labeled anti-human immunoglobulin antibody. 

21. An immunoassay of claim 20 wherein said anti-human immunoglobulin antibody is enzyme labeled. 

22. An immunoassay according to any one of claims 1 6 to 21 wherein said polyprotein comprises an amino 
acid sequence having at least 60% homology to the 859 amino acid sequence in Figure 14. 

23. An immunoassay according to any one of claims 16 to 22 wherein the contiguous sequence is at least 
10 amino acids. 

24. An immunoassay according to any one of claims 16 to 23 wherein the contiguous sequence is at least 
15 amino acids. 

25. An immunoassay according to any one of claims 16 to 24 wherein the contiguous sequence is found in 
45 Figure 14. 

26. An immunoassay according to any one of claims 16 to 24 wherein the contiguous sequence is found in 
Figure 47. 

so 27. An immunoassay according to any one of claims 16 to 24 wherein the contiguous sequence is as 
shown in any one of Figures 1, 3 to 32, 36, 46 or 47, or whose sequence is encoded in a 
polynucleotide selectively hybridisable with the polynucleotide as shown in any one of Figures 1 , 3 to 
32, 36, 46 or 47. 

55 28. An immunoassay according to any one of claims 16 to 27 wherein said contiguous sequence is 
encoded within the Iambda-gt11 cDNA library deposited with the American Type Culture Collection 
(ATCC) under accession no. 40394. 
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29. An immunoassay according to any one of claims 16 to 28 wherein said contiguous sequence rs from a 
nonstructural viral protein. 

30. An immunoassay according to any one of claims 16 to 24 or 26 to 28 wherein said contiguous 
5 sequence is from a structural viral protein. 

31. An immobilised polypeptide for use in the immunoassay of any one of claims 16 to 30 wherein the 
polypeptide comprises an antigenic determinant bindable by an anti-HCV antibody as defined in claim 
16. 

10 ' ;• ' ■ ; 

32. A polynucleotide in substantially isolated form comprising a contiguous sequence of nucleotides which 
is capable of selectively hybridizing to the genome of hepatitis C virus (HCV) or the compliment 
thereof, wherein. HCV is characterized by: 

(i) a positive stranded RNA genome; 
75 (ii) said genome comprising an open reading frame (ORF) encoding a poly protein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14. 

33. A polynucleotide according to claim 32 wherein said polyprotein comprises an amino acid sequence 
20 having at least 60% homology to the 859 amino acid sequence in Figure 14. 

34. A polynucleotide according to claim 32 or 33 wherein said contiguous sequence is at least 10 
nucleotides. 

25 35. A polynucleotide according to claim 34 wherein said contiguous sequence is at least 15 nucleotides. 

36. A polynucleotide according to claim 35 wherein said contiguous sequence is at least 20 nucleotides. 

37. A polynucleotide according to any one of claims 32 to 36 which is a DNA polynucleotide. 

30 

38. A polynucleotide according to any one of claims 32 to 36 which is a RNA polynucleotide. 

39. A polynucleotide according to any one of claims 32 to 38 fixed to a solid phase. : 

35 40. A probe which comprises a polynucleotide according to any one of claims 32 to 39 further comprising a 
detectable label. 

41. An assay kit comprising a polynucleotide probe according to any one of claims 32 to 40 in a suitable 
container. 

40 

42. A polymerase chain reaction (PCR) kit comprising a pair of primers capable of priming the synthesis of 
cDNA in a PCR reaction, wherein each of said primers is a polynucleotide according to any one of 
claims 32 to 37. 

45 43. A PCR kit according to claim 42 further comprising a polynucleotide probe capable of selectively - 
hybridising to a region of the HCV genome between and not including the HCV sequences from which 
the primers are derived. 

44, A method of performing a polymerase chain reaction wherein the primers are a pair of polynucleotides 
so according to any of claims 32 to 37. 

45. A method for assaying a sample for the presence or absence of HCV polynucleotides comprising: 

(a) contacting the sample with a probe comprising a polynucleotide according to any one of claims 
32 to 40 under conditions that allow the selective hybridisation of said probe to an HCV poly- 

55 nucleotide or the compliment thereof in the sample; and 

(b) determining whether polynucleotide duplexes comprising said probe are formed. 
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46. A DNA polynucleotide encoding a polypeptide, which polypeptide comprises a contiguous sequence of 
at least 10 amino acids encoded by the genome of hepatitis C virus (HCV) and comprising an antigenic 
determinant, wherein HCV is characterized by: 

(i) a positive stranded RNA genome; 
s (ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14. 

47. A DNA nucleotide according to claim 46 wherein said polyprotein comprises an amino acid sequence 
io having at least 60% homology to the 859 amino acid sequence in Figure 14. 

48. A DNA polynucleotide according to claim 46 or 47 wherein said contiguous sequence encodes at least 
1 5 amino acids. 

75 49. A DNA polynucleotide according to any one of claims 46 to 48 wherein said contiguous sequence is 
found in Figure 14. 

50. A DNA polynucleotide according to any one of claims 46 to 48 wherein said contiguous sequence is 
found in Figure 47. 

20 

51. A DNA polynucleotide according to any one of claims 46 to 48 wherein said contiguous sequence is 
encoded within the Iambda-gt11 cDNA library deposited with the American Type Culture Collection 
(ATCC) under accession no. 40394. 

25 52. A DNA polynucleotide according to any one of claims 46 to 48 whose sequence is as shown in any one 
of Figures 1, 3 to 32, 36, 46 or 47, or whose sequence is selectively hybridisable with the 
polynucleotide as shown in any one of Figures 1 , 3 to 32, 36, 46 or 47. 

53. A DNA polynucleotide according to any one of claims 46 to 52 wherein said contiguous sequence is 
30 from a nonstructural viral protein. 

54. A DNA polynucleotide according to any one of claims 46 to 48 or 50 to 52 wherein said contiguous 
sequence is from a structural viral protein. 

35 55. A recombinant vector comprising a coding sequence which comprises a DNA polynucleotide according 
to any one of claims 46 to 54. 

56. A host cell transformed by a recombinant vector according to claim 55 wherein the coding sequence is 
operably linked to a control sequence capable of providing for the expression of the coding sequence 

40 by the host cell. 

57. A method of producing a recombinant HCV polypeptide comprising incubating a host cell according to 
claim 56 under conditions that provide for the expression of the coding sequence. 

45 58. An anti-HCV antibody composition comprising antibodies that bind said antigenic determinant of a 
polypeptide according to any one of claims 1 to 12 which is (a) a purified preparation of polyclonal 
antibodies, or (b) a monoclonal antibody composition. 


50 


59. A composition according to claim 58 wherein the anti-HCV antibodies are fixed to a solid phase. 

60. An immunoassay kit comprising an anti-HCV antibody composition according to claim 58 or 59 in a 
suitable container. 


61. An immunoassay method for detecting an HCV antigen in a sample comprising: 
55 (a) providing an anti-HCV antibody composition according to claim 58 or 59; 

(b) incubating a sample with said anti-HCV antibody composition under conditions that allow for the 
formation of an antibody-antigen complex; and 

(c) determining whether antibody-antigen complex comprising the anti-HCV antibody is formed. 


75 
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62. A polypeptide comprising a contiguous sequence of at least 10 amino acids encoded by the genome of 
hepatitis C virus (HCV) and comprising an antigenic determinant, wherein said contiguous sequence is 
fused to a non-HCV amino acid sequence, and wherein HCV is characterized by: 

(i) a positive stranded RNA genome; 
5 (ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14. 

63. A polypeptide according to claim 62 wherein said polyprotein comprises an amino acid sequence 
70 having at least 60% homology to the 859 amino acid sequence in Figure 14. 

64. A polypeptide according to claim 62 or 63 wherein said non-HCV amino acid sequence comprises a 
signal sequence. 

75 65. A polypeptide according to claim 62 or 63 wherein said non-HCV amino acid sequence comprises an 
amino acid sequence from beta-galactosidase or superoxide dismutase. 

66. A polypeptide according to claim 62 or 63 wherein the non-HCV amino acid sequence comprises a 
particle-forming protein. 

20 

67. A polypeptide according to claim 66 wherein the particle-forming protein comprises hepatitis B surface 
antigen. 

68. A polypeptide according to any one of claims 1 to 12 or 62 to 67 for use in a method of making anti- 
25 HCV antibodies which comprises administering the polypeptide to a mammal in an amount sufficient to 

produce an immune response. 

69. A composition comprising a polypeptide according to any one of claims 62 to 67 mixed with a 
pharmaceutical^ acceptable excipient. 


30 


70. A vaccine according to claim 69. 


71. A method of growing hepatitis C virus (HCV) comprising providing cells infected with HCV, and 
propagating said ceils in vitro, wherein said HCV is characterized by: 
35 (i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14. 

40 72. A method according to claim 71 wherein said polyprotein comprises an amino acid sequence having at 
least 60% homology to the 859 amino acid sequence in Figure 14. 

73. A method according to claim 71 or 72 wherein said cells comprise primary cells. 

45 74. A method according to claim 71 or 72 wherein said cells comprise a cell line. 

75. A method according to any one of claims 71 to 74 wherein said cells are hepatocytes or macrophages. 

76. A hepatitis C virus (HCV) immunoassay antigen fixed to a solid phase, wherein HCV is characterized 
so by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14; 

55 said antigen comprising an antigenic determinant immunologically reactive with an anti-HCV antibody, 
wherein 

(a) said anti-HCV antibody is immunologically reactive with a reference antigenic determinant (i) 
encoded by an HCV cDNA insert in the Iambda-gt11 library deposited with the American Type 
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Culture Collection (ATCC) under accession no. 40394, or (ii) found in Figure 47; and 

(b) said reference antigenic determinant is immunologically reactive with sera from HCV-infected 

humans. 

5 77. An HCV antigen according to claim .76 wherein said reference antigenic determinant is found in Figure 
14. ' 

Claims for the following Contracting State : ES 

io 1. An immunoassay for detecting antibody against hepatitis C virus (HCV) (antirHCV antibody), which 
immunoassay comprises: 

(a) providing a polypeptide comprising an antigenic determinant bindable by said anti-HCV antibody, 
wherein said antigenic determinant comprises a contiguous amino acid sequence encoded by the 
HCV genome and wherein HCV is characterized by: 

75 (i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (OR F) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence haying at least 40% homology to the 
859 amino acid sequence in Figure 14, 

(b) incubating a biological sample with said polypeptide under conditions that allow for the formation 
20 of antibody-antigen complex; and 

■ (c) determining whether antibody-antigen complex comprising said polypeptide is formed. : 

2. An immunoassay according to claim 1 wherein the polypeptide is prepared by recombinant DNA 
expression. 

25 

3. An immunoassay according to claim 1 wherein the polypeptide is prepared by chemical synthesis. 

4. An immunoassay according to any one of claims 1 to 3 wherein said polypeptide is attached to a solid 
support. 

5. An immunoassay according to any one of claims 1 to 4 wherein said antibody-antigen complexes are 
detected by incubating the complexes with a labeled anti-human immunoglobulin antibody, 

6. An immunoassay of claim 5 wherein said anti-human immunoglobulin antibody is enzyme labeled. 

7. An immunoassay according to any one of claims 1 to 6 wherein said polyprotein comprises an amino 
acid sequence having at least 60% homology to the 859 amino acid sequence in Figure 14. 

8. An immunoassay according to any one of claims 1 to 7 wherein the contiguous sequence is at least 10 
40 amino acids. 

9. An immunoassay according to any one of claims 1 to 8 wherein the contiguous sequence is at least 15 
amino acids. 

45 10. An immunoassay according to any one of claims 1 to 9 wherein the contiguous sequence is found in 
Figure 14. 

11. An immunoassay according to any one of claims 1 to 9 wherein the contiguous sequence is found in 
Figure 47. 

50 

12. An immunoassay according to any one of claims 1 to 9 wherein the contiguous sequence is as shown 
in any one of Figures 1, 3 to 32, 36, 46 or 47, or whose sequence is encoded in a polynucleotide 
selectively hybridisable with the polynucleotide as shown in any one of Figures 1 , 3 to 32, 36, 46 or 47. 

55 13. An immunoassay according to any one of claims 1 to 12 wherein, said contiguous sequence is encoded 
within the Iambda-gt1 1 cDNA library deposited with the American Type Culture Collection (ATCC) 
under accession no. 40394. 
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14. An immunoassay according to any one of claims 1 to 13 wherein said contiguous sequence is from a 
nonstructural viral protein. 

15. An immunoassay according to any one of claims 1 to 9 or 11 to 13 wherein said contiguous sequence 
5 is from a structural viral protein. 

16. A method of performing a polymerase chain reaction wherein the primers are a pair of polynucleotides, 
which polynucleotides are each a contiguous sequence of nucleotides which is capable of selectively 
hybridizing to the genome of hepatitis C virus (HCV) or the compliment thereof, wherein HCV is 

10 characterized by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14. 

75 

17. A method according to claim 16 wherein said polyprotein comprises an amino acid sequence having at 
least 60% homology to the 859 amino acid sequence in Figure 14. 

18. A method according to claim 16 or 17 wherein said contiguous polynucleotide sequence is at least 10 
20 nucleotides. 

19. A method according to claim 18 wherein said contiguous polynucleotide sequence is at least 15 
nucleotides. 

25 20. A method according to claim 19 wherein said contiguous polynucleotide sequence is at least 20 
nucleotides. 

21. A method according to any one of claims 16 to 20 wherein the polynucleotide is a DNA polynucleotide. 

30 22. A method according to any one of claims 16 to 21 which further comprises the use of a polynucleotide 
probe capable of selectively hybridising to a region of the HCV genome between and not including the 
HCV sequences from which the primers are derived. 

23. A method for assaying a sample for the presence or absence of HCV polynucleotides comprising: 
35 (a) contacting the sample with a probe comprising a polynucleotide as defined in any one of claims 

16 to 21 under conditions that allow the selective hybridisation of said probe to an HCV poly- 
nucleotide or the compliment thereof in the sample; and 

(b) determining whether polynucleotide duplexes comprising said probe are formed. 
40 24. A method according to claim 23 wherein the probe is a RNA polynucleotide. 

25. A method according to claim 23 or 24 wherein the probe is fixed to a solid phase. 

26. A method according to any one of claims 23 to 25 wherein the probe further comprises a detectable 
45 label. 

27. A method of producing a recombinant HCV polypeptide comprising incubating a host cell transformed 
by a recombinant vector comprising a coding sequence of a DNA polynucleotide encoding a polypep- 
tide, which polypeptide comprises a contiguous sequence of at least 10 amino acids encoded by the 

50 genome of hepatitis C virus (HCV) and comprising an antigenic determinant, wherein HCV is character- 
ized by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
55 amino acid sequence in Figure 14, 

wherein the coding sequence is operably linked to a control sequence capable of providing for the 
expression of the coding sequence by the host cell under conditions that provide for the expression of 
the coding sequence. 
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28. A method according to claim 27 wherein said polyprotein comprises an amino acid sequence having at 
least 60% homology to the 859 amino acid sequence in Figure 14. 

29. A method according to claim 27 or 28 wherein said contiguous sequence encodes at least 15 amino 
5 acids. 

30. A method according to any one of claims 27 to 29 wherein said contiguous sequence is found in Figure 
14. 

70 31. A method according to any one of claims 27 to 29 wherein said contiguous sequence is found in Figure 
47. 

32. A method according to any one of claims 27 to 29 wherein said contiguous sequence is encoded within 
the Iambda-gt11 cDNA library deposited with the American Type Culture Collection (ATCC) under 

75 accession no. 40394. 

33. A method according to any one of claims 27 to 29 wherein the DNA polynucleotide sequence is as 
shown in any one of Figures 1, 3 to 32, 36, 46 or 47, or whose sequence is selectively hybridisable 
with the polynucleotide as shown in any one of Figures 1 , 3 to 32, 36, 46 or 47. 

20 

34. A method according to any one of claims 27 to 33 wherein said contiguous sequence is from a 
nonstructural viral protein. 

35. A method according to any one of claims 27 to 29 or 31 to 33 wherein said contiguous sequence is 
25 from a structural viral protein. 

36. A method comprising a mixing polypeptide in substantially isolated form prepared by the. method of 
any one of claims 26 to 35 or a polypeptide of the same sequence prepared by chemical synthesis 
with a pharmaceutical^ acceptable excipient. 

30 

37. A method according to claim 36 for the preparation of a vaccine. 

38. A method according to claim 27 or 28 wherein said contiguous sequence is fused to a non-HCV amino 
acid sequence. 

35 

39. A method according to claim 38 wherein said non-HCV amino acid sequence comprises a signal 
sequence. 

40. A method according to claim 38 wherein said non-HCV amino acid sequence comprises an amino acid 
40 sequence from beta-galactosidase or superoxide dismutase. 

41. A method according to claim 38 wherein the non-HCV amino acid sequence comprises a particle- 
forming protein. 

45 42. A method according to claim 41 wherein the particle-forming protein comprises hepatitis B surface 
antigen. 

43. A method comprising mixing a polypeptide prepared by the method of any one of claims 38 to 42 with 
a pharmaceutically acceptable excipient. 

50 

44. A method according to claim 43 for the preparation of a vaccine. 

45. An immunoassay method for detecting an HCV antigen in a sample comprising: 

(a) providing an anti-HCV antibody composition comprising antibodies that bind said antigenic 
55 determinant of a polypeptide as defined in any one of claims 1 to 4 or 7 to 15 which is (a) a purified 

preparation of polyclonal antibodies, or (b) a monoclonal antibody composition; 

(b) incubating a sample with said anti-HCV antibody composition under conditions that allow for the 
formation of an antibody-antigen complex; and 
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(c) determining whether antibody-antigen complex comprising the anti-HCV antibody is formed. 

46. A method according to claim 45 wherein the anti-HCV antibodies are fixed to a solid phase. 

5 47. A method of growing hepatitis C virus (HCV) comprising providing cells infected with HCV, and 
propagating said cells in vitro, wherein said HCV is characterized by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
70 amino acid sequence in Figure 14. 

48. A method according to claim 47 wherein said polyprotein comprises an amino acid sequence having at 
least 60% homology to the 859 amino acid sequence in Figure 14. 

75 49. A method according to claim 47 or 48 wherein said cells comprise primary cells. 

50. A method according to claim 47 or 48 wherein said cells comprise a cell line. 

51. A method according to any one of claims 47 to 50 wherein said cells are hepatocytes or macrophages. 
20 ' ; • . 

52. A method of preparing a hepatitis C virus (HCV) immunoassay antigen, wherein HCV is characterized 
by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

25 (iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 

amino acid sequence in Figure 14; 
said antigen comprising an antigenic determinant immunologically reactive with an anti-HCV antibody, 
wherein 

(a) said anti-HCV antibody is immunologically reactive with a reference antigenic determinant (i) 
30 encoded by an HCV cDNA insert in the Iambda-gt11 library deposited with the American Type 

Culture Collection (ATCC) under accession no: 40394, or (ii) found in Figure 47; and 

(b) said reference antigenic determinant is immunologically reactive with sera from HCV-infected 
humans; 

which comprises fixing said immunoassay antigen to a solid phase. 

35 

53. A method according to claim 52 wherein said reference antigenic determinant is found in Figure 14. 

Claims for the following Contracting State : GR 

40 1. An immunoassay for detecting antibody against hepatitis C virus (HCV) (anti-HCV antibody), which 
immunoassay comprises: 

(a) providing a polypeptide comprising an antigenic determinant bindabie by said anti-HCV antibody, . ; 
wherein said antigenic determinant comprises a contiguous amino acid sequence encoded by the 
HCV genome and wherein HCV is characterized by: 

45 (i) a positive stranded RNA genome; . 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 
859 amino acid sequence in Figure 14, 

(b) incubating a biological sample with said polypeptide under conditions that allow for the formation 
so of antibody-antigen complex; and 

(c) determining whether antibody-antigen complex comprising said polypeptide is formed. 

2. An immunoassay according to claim 1 : wherein the polypeptide is prepared by recombinant DNA 
expression. 

55 

3. An immunoassay according to claim 1 wherein the polypeptide is prepared by chemical synthesis. 
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4. An immunoassay according to any one of claims 1 to 3 wherein said polypeptide is attached to a solid 
. support. 

5. An immunoassay according to any one of claims 1 to 4 wherein said antibody-antigen complexes are 
5 detected by incubating the complexes with a labeled anti-human immunoglobulin antibody. 

6. An immunoassay of claim 5 wherein said anti-human immunoglobulin antibody is enzyme labeled. 

7. An immunoassay according to any one of claims 1 to 6 wherein said polyprotein comprises an amino 
70 acid sequence having at least 60% homology to the 859 amino acid sequence in Figure 14. 

8.. An immunoassay according to any one of claims 1 to 7 wherein the contiguous sequence is at least 10 
amino acids. 

75 9. An immunoassay according to any one of claims 1 to 8 wherein the contiguous sequence is at least 15 
amino acids. 

10. An immunoassay according to any one of claims 1 to 9 wherein the contiguous sequence is found in 
Figure 14. 

20 

11. An immunoassay according to any one of claims 1 to 9 wherein the contiguous sequence is found in 
Figure 47. 

12. An immunoassay according to any one of claims 1 to 9 wherein the contiguous sequence is as shown 
25 in any one of Figures 1, 3 to 32, 36, 46 or 47, or whose sequence is encoded in a polynucleotide 

selectively hybridisable with the polynucleotide as shown in any one of Figures 1, 3 to 32, 36, 46 or 47. 

13. An immunoassay according to any one of claims 1 to 12 wherein said contiguous sequence is encoded 
within the Iambda-gt11 cDNA library deposited with the American: Type Culture Collection (ATCC) 

30 under accession no. 40394. 

14. An immunoassay according to any one of claims 1 to 13 wherein said contiguous sequence is from a 
nonstructural viral protein. 

35 15. An immunoassay according to any one of claims 1 to 9 or 11 to 13 wherein said contiguous sequence 
is from a structural viral protein. 

16. A method of performing a polymerase chain reaction wherejn the primers are a pair of polynucleotides, 
which polynucleotides are each a contiguous sequence of nucleotides which is capable of selectively 

40 hybridizing to the genome of hepatitis C virus (HCV) or the compliment thereof, wherein HCV is 
characterized by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
45 amino acid sequence in Figure 14. 

17. A method according to claim 16 wherein said polyprotein comprises an amino acid sequence having at 
least 60% homology to the 859 amino acid sequence in Figure 14, 

50 18. A method according to claim 16 or . 17 wherein said contiguous polynucleotide sequence is at least 10 
nucleotides. 

19. A method according to claim 18 wherein .said contiguous polynucleotide sequence is at least 15 
nucleotides. 

55 

20. A method according to claim 19 wherein said contiguous polynucleotide sequence is at least 20 
nucleotides. / . , 
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21. A method according to any one of claims 16 to 20 wherein the polynucleotide is a DNA polynucleotide. 

22. A method according to any one of claims 16 to 21 which further comprises the use of a polynucleotide 
probe capable of selectively hybridising to a region of the HCV genome between and not including the 

5 HCV sequences from which the primers are derived. 

A method for assaying a sample for the presence or absence of HCV polynucleotides comprising: 

(a) contacting the sample with a probe comprising a polynucleotide as defined in any one of claims 
16 to 21 under conditions that allow the selective hybridisation of said probe to an HCV poly- 
nucleotide or the compliment thereof in the sample; and 

(b) determining whether polynucleotide duplexes comprising said probe are formed. 
24. A method according to claim 23 wherein the probe is a RNA polynucleotide. 

75 25. A method according to claim 23 or 24 wherein the probe is fixed to a solid phase. 

26. A method according to any one of claims 23 to 25 wherein the probe further comprises a detectable 
label. 

A method of producing a recombinant HCV polypeptide comprising incubating a host cell transformed 
by a recombinant vector comprising a coding sequence of a DNA polynucleotide encoding a polypep- 
tide, which polypeptide comprises a contiguous sequence of at least 10 amino acids encoded by the 
genome of hepatitis C virus (HCV) and comprising an antigenic determinant, wherein HCV is character- 
ized by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14, 

wherein the coding sequence is operably linked to a control sequence capable of providing for the 
expression of the coding sequence by the host cell under conditions that provide for the expression of 
the coding sequence. 

A method according to claim 27 wherein said polyprotein comprises an amino acid sequence having at 
least 60% homology to the 859 amino acid sequence in Figure 14. 

A method according to claim 27 or 28 wherein said contiguous sequence encodes at least 15 amino 
acids. 

30. A method according to any one of claims 27 to 29 wherein said contiguous sequence is found in Figure 
40 14. 

31. A method according to any one of claims 27 to 29 wherein said contiguous sequence is found in Figure 
47. 

45 32. A method according to any one of claims 27 to 29 wherein said contiguous sequence is encoded within 
the Iambda-gt11 cDNA library deposited with the American Type Culture Collection (ATCC) under 
accession no. 40394. 

33. A method according to any one of claims 27 to 29 wherein the DNA polynucleotide sequence is as 
so shown in any one of Figures 1 , 3 to 32, 36, 46 or 47, or whose sequence is selectively hybridisable 

with the polynucleotide as shown in any one of Figures 1 , 3 to 32, 36, 46 or 47. 

34. A method according to any one of claims 27 to 33 wherein said contiguous sequence is from a 
nonstructural viral protein. 

55 

35. A method according to any one of claims 27 to 29 or 31 to 33 wherein said contiguous sequence is 
from a structural viral protein. 
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36. A method comprising a mixing polypeptide in substantially isolated form prepared by the method of 
any one of claims 26 to 35 or a polypeptide of the same sequence prepared by chemical synthesis 
with a pharmaceutical^ acceptable excipient. 

5 37. A method according to claim 36 for the preparation of a vaccine. 

38. A method according to claim 27 or 28 wherein said contiguous sequence is fused to a non-HCV amino 
acid sequence. 

w 39. A method according to claim 38 wherein said non-HCV amino acid sequence comprises a signal 
sequence. 

40. A method according to claim 38 wherein said non-HCV amino acid sequence comprises an amino acid 
sequence from beta-gal actosidase or superoxide dismuiase. 

75 ' ■ " : . • ' •' '• ■'■ ,' : " / ■ 

41. A method according to claim 38 wherein the non-HCV amino acid sequence comprises a particle- 
forming protein. 

42. A method according to claim 41 wherein the particle-forming protein comprises hepatitis B surface 
20 antigen. 

43. A method comprising mixing a polypeptide prepared by the method of any one of claims 38 to 42 with 
a pharmaceutical^ acceptable excipient. 

25 44. A method according to claim 43 for the preparation of a vaccine. 

45. An immunoassay method for detecting an HCV antigen in a sample comprising: 

(a) providing an anti-HCV antibody composition comprising antibodies that bind said antigenic 
determinant of a polypeptide as defined in any one of claims 1 to 4 or 7 to 15 which is (a) a purified 

30 preparation of polyclonal antibodies* or (b) a monoclonal antibody composition; 

(b) incubating a sample with said anti-HCV antibody composition under conditions that allow for the 
formation of an antibody-antigen complex; and 

(c) determining whether antibody-antigen complex comprising the anti-HCV antibody is formed. 
35 46. A method according to claim 45 wherein the anti-HCV antibodies are fixed to a solid phase. ; 

47. A method of growing hepatitis C virus (HCV) comprising providing cells infected with HCV, and 
propagating said cells in vitro, wherein said HCV is characterized by: 

(i) a positive stranded RNA genome; 

40 (ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iiij said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14. 

48. A method according to claim 47 wherein said polyprotein comprises an amino acid sequence having at 
45 least 60% homology to the 859 amino acid sequence in Figure 14. 

49. A method according to claim 47 or 48 wherein said cells comprise primary cells. 

50. A method according to claim 47 or 48 wherein said cells comprise a cell line. 

50 

51. A method according to any one of claims 47 to 50 wherein said cells are hepatocytes or macrophages. 

52. A method of preparing a hepatitis C virus (HCV) immunoassay antigen, wherein HCV is characterized 
by: 

55 (i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14; 
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said antigen comprising an antigenic determinant immunologically reactive with an anti-HCV antibody, 
wherein 

(a) said anti-HCV antibody is immunologically reactive with a reference antigenic determinant (i) 
encoded by an HCV cDNA insert in the Iambda-gt11 library deposited with the American Type 
Culture Collection (ATCC) under accession no. 40394, or (ii) found in Figure 47; and 

(b) said reference antigenic determinant is immunologically reactive with sera from HCV-infected 
humans; 

which comprises fixing said immunoassay antigen to a solid phase. 

53. A method according to claim 52 wherein said reference antigenic determinant is found in Figure 14. 

54. An immobilised polypeptide for use in the immunoassay of any one of claims 1 to 15 wherein the. 
polypeptide comprises an antigenic determinant bindable by an anti-HCV antibody as defined in claim 
1. 

55. A polypeptide in substantially isolated form comprising a contiguous sequence of at least 10 amino 
acids encoded by the genome of hepatitis C virus (HCV) and comprising an antigenic determinant, 
wherein HCV is characterized by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14. 

56. A polypeptide according to claim 55 which is as defined in any one of claims 2 to 4 or 7 to 15. 

57. A polynucleotide in substantially isolated form comprising a contiguous sequence of nucleotides which 
is capable of selectively hybridizing to the genome of hepatitis C virus (HCV) or the compliment 
thereof, wherein HCV is characterized by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14. 

58. A polynucleotide according to claim 57 which is as defined in any one of claims 17 to 21 or 24 to 26. 

59. An assay kit comprising a polynucleotide probe according to claim 58 in a suitable container. 

60. A polymerase claim reaction kit comprising a pair of primers capable of priming the synthesis of cDNA 
in a PCR reaction, wherein each of said primers is a polynucleotide as defined in any one of claims 16 
to 21. 

61. A PCR kit according to claim 60 further comprising a polynucleotide probe capable of selectively 
hybridising to a region of the HCV genome between and not including the HCV sequences from which 
the primers are derived. 

62. A DNA polynucleotide encoding a polypeptide, which polypeptide comprises a contiguous sequence of 
at least 10 amino acids encoded by the genome of hepatitis C virus (HCV) and comprising an antigenic 
determinant, wherein HCV is characterized by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14. 

63. A DNA polynucleotide according to claim 62 which is as defined in any one of claims 28 to 35. 

64. A recombinant vector comprising a coding sequence which comprises a DNA polynucleotide according 
to any one of claims 62 or 63. 
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65. A host cell transformed by a recombinant vector according to claim 64 wherein the coding sequence is 
operably linked to a control sequence capable of providing for the expression of the coding sequence 
by the host cell. 

5 66. An anti-HCV antibody composition comprising antibodies that bind said antigenic determinant of a 
polypeptide according to any one of claims 55 or 56 which is (a) a purified preparation of polyclonal 
antibodies, or (b) a monoclonal antibody composition. 

67. A composition according to claim 66 wherein the anti-HCV antibodies are fixed to a solid phase. 

10 

68. An immunoassay kit comprising an anti-HCV antibody composition according to claim 66 or 67 in a 
suitable container. 

69. A polypeptide comprising a contiguous sequence of at least 10 amino acids encoded by the genome of 
is hepatitis C virus (HCV) and comprising an antigenic determinant, wherein said contiguous sequence is 

fused to a non-HCV amino acid sequence, and wherein HCV is characterized by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
20 amino acid sequence in Figure 14. 

70. A polypeptide according to claim 69 which is as defined in any one of claims 39 to 42. 

71. A hepatitis C virus (HCV) immunoassay antigen fixed to a solid phase, wherein HCV is characterized 
25 by: 

(i) a positive stranded RNA genome; 

(ii) said genome comprising an open reading frame (ORF) encoding a polyprotein; and 

(iii) said polyprotein comprising an amino acid sequence having at least 40% homology to the 859 
amino acid sequence in Figure 14; 

30 said antigen comprising an antigenic determinant immunologically reactive with an anti-HCV antibody, 
wherein 

(a) said anti-HCV antibody is immunologically reactive with a reference antigenic determinant (i) 
encoded by an HCV cDNA insert in the Iambda-gt11 library deposited with the American Type 
Culture Collection (ATCC) under accession no. 40394, or (ii) found in Figure 47; and 
35 (b) said reference antigenic determinant is immunologically reactive with sera from HCV-infected 

humans. 

72. An HCV antigen according to claim 71 wherein said reference antigenic determinant is found in Figure 
14. 

40 

Patentanspruche 

Patentanspruche fur folgende Vertragsstaaten : AT, BE, CH, DE, FR, GB, IT, LI, LU, NL, SE 

1- Polypeptid in im wesentlichen isolierter Form, umfassend eine Sequenz von mindestens 10 aufeinan- 
45 derfolgenden Aminosauren, die von dem Genom des Hepatitis C- Virus (HCV) codiert wird und eine 
antigene Determinante umfaSt, wobei HCV charakterisiert ist durch: 

(i) einen positiven Strang des RNA-Genoms; 

(ii) das Genom umfaGt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine AminosSuresequenz, die mindestens 40 % Homologie mit der 859 
50 Aminosauren umfassenden Sequenz in Figur 14 aufweist. 

2. Polypeptid nach Anspruch 1 . wobei das Polyprotein eine Aminos&uresequenz umfaSt, die mindestens 
60 % Homologie mit der 859 AminosSuren umfassenden Sequenz in Fig. 14 aufweist. 

55 3. Polypeptid nach Anspruch 1 Oder 2, umfassend mindestens 15 Aminosauren, 

4. Polypeptid nach einem der vorangehenden Anspruche, hergestellt durch Expression rekombinanter 
DNA. 
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5. Polypeptid nach einem der AnsprGche 1 bis 3, hergestetlt durch chemische Synthese. 

6. Polypeptid nach einem der AnsprGche 1 bis 5 t wobei die aufeinanderfolgende Sequenz in Fig. 14 
gezeigt ist. 

5 

7. Polypeptid nach einem der Anspruche 1 bis 5 t wobei die aufeinanderfolgende Sequenz in Fig. 47 
gezeigt ist. 

8. Polypeptid nach einem der AnsprGche 1 bis 5, wobei die aufeinanderfolgende Sequenz innerhalb der 
io Iambda-gt1 1 -cDNA-Genbank codiert ist, die bei der American Type Culture Collection (ATCC) unter der 

Hinterlegungs-Nummer 40394 hinterlegt ist. 

9. Polypeptid nach einem der AnsprGche 1 bis 8, wobei die aufeinanderfolgende Sequenz von einem 
nichtstrukturellen viralen Protein stammt. 

75 

10. Polypeptid nach einem der AnsprGche 1 bis 5, 7 oder 8, wobei die aufeinanderfolgende Sequenz von 
einem strukturellen viralen Protein stammt. 

11. Polypeptid nach einem der AnsprGche 1 bis 8, dessen Sequenz in einer der Figuren 1, 3 bis 32, 36, 46 
20 und 47 gezeigt ist, Oder dessen Sequenz von einem Polynucleotid codiert wird, das selektiv mit dem 

Polynucleotid gemaB einer der Figuren 1, 3 bis 32, 36, 46 oder 47 hybridisierbar ist. 

12. Polypeptid nach einem der Anspruche 1 bis 1 1, wobei das Polypeptid an eine feste Phase fixiert jst. 

25 13. Immunoassay-Kit, umfassend ein Polypeptid nach einem der AnsprGche 1 bis 12 in einem geeigneten 
Behalter. 

14. Zusammensetzung, umfassend ein Polypeptid in im wesentlichen isolierter Form nach einem der 
AnsprGche 1 bis 11 im Gem isch mit einem pharmazeutisch vertrSglichen Excipienten. 


30 


15. Impfstoffzusammensetzung nach Anspruch 14. 


16. Immunoassay zum Nachweis eines Antikorpers gegen Hepatitis C-Virus: (HCV) (anti-HCV-Antikorper), 
wobei der Immunoassay folgende Schritte umfaBt: 

35 (a) Bereitstellung eines Polypeptids, umfassend eine antigene Determinante, an die der anti-HCVv 

AntikSrper bindet, wobei die antigene Determinante eine aufeinanderfolgende Aminosauresequenz 
umfaBt, die von dem HCV-Genom codiert wird, und wobei HCV folgendermaBen charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

40 (iii) das Polyprotein umfaBt eine Aminosauresequenz, die mindestens 40 % Homologie mit der. 

859 AminosSuren umfassenden Sequenz in Fig. 14 aufweist, 

(b) Inkubation einer biologischen Probe mit dem Polypeptid unter Bedingungen, die die Ausbildung 
des Antikorper-Antigen-Komplexes erlauben; und 

(c) Bestimmung, ob der das Polypeptid umfassende Antik5rper-Antigen^Komplex ausgebildet wurde. 

17. Immunoassay nach Anspruch 16, wobei das Polypeptid durch Expression rekombinanter DNA herge- 
stellt wurde. 

1a Immunoassay nach Anspruch 16, wobei das Polypeptid durch chemische Synthese hergestellt wurde. : 

50 

19. Immunoassay nach einem der AnsprGche 16 bis 18, wobei das Polypeptid an einen festen TrSger 
geknGpft ist. 

20. Immunoassay nach einem der AnsprGche 16 bis 19, wobei die AntikSrper-Antigen-Komplexe durch 
55 Inkubation der Komplexe mit einem markierten Antikorper gegen menschliches Immunglobulin nachge- 

wiesen werden. 
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21. Immunoassay nach Anspruch 20, wobei der AntikSrper gegen mehschliches Immunglobulin enzymmar- 
kiert ist. 

22. Immunoassay nach einem der AnsprUche 16 bis 21, wobei das Polyprotein eine AminosSuresequenz 
s umfafit, die mindestens 60 % Homologie mit der 859 Aminosauren umfassenden Sequehz in Fig; 14 

aufweist. 

23. Immunoassay nach einem der AnsprUche 16 bis 22, wobei die Sequenz mindestens 10 aufeinanderfol- 
gende Aminosauren umfafit. 

10 ' ■ ■:/ / : : '" • " v " . 

24. Immunoassay nach einem der AnsprUche 16 bis 23, wobei die Sequenz mindestens 15 aufeinanderfol- 
gende Aminosauren umfaBt. 

25. Immunoassay nach einem der AnsprUche 16 bis 24, wobei die aufeinanderfolgende Sequenz in Fig. 14 
75 gezeigt ist. 

26. Immunoassay nach einem der AnsprUche 16 bis 24, wobei die aufeinanderfolgende Sequenz in Fig. 47 
gezeigt ist. 

20 27. Immunoassay nach einem der AnsprUche 16 bis 24, wobei die aufeinanderfolgende Sequenz eine 
Sequenz ist, wie sie in einer der Figuren 1, 3 bis 32, 36, 46 oder 47 gezeigt ist, oder deren Sequenz 
von einem Polynucleotid codiert wird, das selektiv mit dem Polynucleotid hybridisierbar ist, wie es in 
einer der Figuren 1 , 3 bis 32, 36, 46 oder 47 gezeigt ist. 

25 28. Immunoassay nach einem der AnsprUche 16 bis 27, wobei die aufeinanderfolgende Sequenz innerhalb 
der lambda-gt11-cDNA-Genbank codiert ist, die bei der American Type Culture Collection (ATCC) unter 
der Hinterlegungs-Nr. 40394 hinterlegt ist. 

29. Immunoassay nach einem der AnsprUche 16 bis 28, wobei die aufeinanderfolgende Sequenz von 
30 einem nichtstrukturellen viralen Protein stammt. 

30. Immunoassay nach einem der AnsprUche 16 bis 24 oder 26 bis 28, wobei die aufeinanderfolgende 
Sequenz von einem struktureiien viralen Protein stammt. : 

35 31. Immobilisiertes Polypeptid zur Verwendung in einem Immunoassay nach einem der AnsprUche 16 bis 
30, wobei das Polypeptid eine antigene Determinate umfaBt, die durch einen anti-HCV-AntikQrper 
gema'B der Definition in Anspruch 16 gebunden wird. 

32. Polynucleotid in im wesentlichen isolierter Form, umfassend eine Sequenz von aufeinanderfolgenden 
40 Nucleotiden, die zur selektiven Hybridisierung mit dem Genom von Hepatitis C-Virus (HCV) oder dem 

Komplement davon fShig ist, wobei HCV folgendermaBen charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches eiri Polyprotein codiert; urid 

(iii) das Polyprotein umfaBt eine AminosSuresequenz, die mindestens 40 % Homologie mit der 859 
45 Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

33. Polynucleotid nach Anspruch 32, wobei das Polyprotein eine AminosSuresequenz umfaBt, die minde- 
stens 60 % Homologie mit der 859 Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

50 34. Polynucleotid nach Anspruch 32 oder 33, wobei die Sequenz mindestens 10 aufeinanderfolgende 
Nucleotide umfaBt. 

35. Polynucleotid nach Anspruch 34, wobei die Sequenz mindestens 15 aufeinanderfolgende Nucleotide 
umfaBt. 

55 : 

36. Polynucleotid nach Anspruch 35, wobei die Sequenz mindestens 20 aufeinanderfolgende Nucleotide 
umfaBt. 


87 


EP 0 318 216 B1 


37. Polynucleotid nach einem der AnsprOche 32 bis 36, das ein DNA-Polynucleotid ist. 

38. Polynucleotid nach einem der AnsprOche 32 bis 36, das ein RNA-Polynucleotid ist. 

5 39. Polynucleotid nach einem der AnsprOche 32 bis 38, welches an eine feste Phase fixiert ist. 

40. Sonde, umfassend ein Polynucleotid nach einem der AnsprOche 32 bis 39, weiterhin einen nachweisba- 
ren Marker umfassend. 

70 41. Testkit, umfassend eine Polynucleotidsonde nach einem der AnsprUche 32 bis 40 in einem geeigneten 
BehSlter. 

42. Polymerasekettenreaktions (PCR)-Kit, umfassend ein Paar von Primern, die zum Primen der Synthese 
von cDNA in einer PCR-Reaktion fahig sind, wobei jeder der Primer ein Polynucleotid nach einem der 

75 Anspruche 32 bis 37 ist. 

43. PCR-Kit nach Anspruch 42, weiterhin eine Polynucleotidsonde umfassend, die zur selektiven Hybridis- 
ierung mit einem Bereich des HCV-Genoms fahig ist, das zwischen den HCV-Sequenzen liegt, von 
denen die Primer stammen, und diese nicht umfaBt. 

20 

44. Verfahren zur Durchfuhrung einer Polymerasekettenreaktion, wobei die Primer ein Paar von Polynucleo- 
tiden nach einem der AnsprOche 32 bis 37 sind. 

45. Verfahren zum Testen einer Probe auf die Gegenwart oder Abwesenheit von HCV-Polynucleotiden, 
25 umfassend: 

(a) Inkontaktbringen der Probe mit einer Sonde, die ein Polynucleotid nach einem der AnsprOche 32 
bis 40 umfaBt, unter Bedingungen, die die selektive Hybridisierung der Sonde mit einem HCV- 
Polynucleotid Oder dem Komplement davon in der Probe ermSglichen; und 

(b) Bestimmung, ob Polynucleotidduplexe gebildet wurden, die die Sonde umfassen. 

30 

46. DNA-Polynucleotid, das ein Polypeptid codiert, wobei das Polypeptid eine Sequenz von mindestens 10 
aufetnanderfolgenden Aminosauren umfaBt, die von dem Genom von Hepatitis C-Virus (HCV) codiert 
wird und eine antigene Determinante umfaBt, wobei HCV folgendermaBen charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 
35 (ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein. Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine AminosSuresequenz, die mindestens 40 % Homologie mit der 859 
AminosMuren umfassenden Sequenz in Fig. 14 aufweist. 

47. DNA-Polynucleotid nach Anspruch 46, wobei das Polyprotein eine Anriinosauresequenz umfaBt, die 
40 mindestens 60 % Homologie mit der 859 Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

48. DNA-Polynucleotid nach Anspruch 46 oder 47, wobei die Sequenz mindestens 15 aufeinanderfolgende 
Aminosauren codiert. 

45 49. DNA-Polynucleotid nach einem der AnsprOche 46 bis 48, wobei die aufeinanderfolgende Sequenz in 
Fig. 14 gezeigt ist. 

50. DNA-Polynucleotid nach einem der AnsprOche 46 bis 48, wobei die aufeinanderfolgende Sequenz in 
Fig. 47 gezeigt ist. 

50 

51. DNA-Polynucleotid nach einem der AnsprOche 46 bis 48, wobei die aufeinanderfolgende Sequenz 
innerhalb der lambda-gt11-cDNA-Genbank codiert ist, die bei der American Type Culture Collection 
(ATCC) unter der Hinterlegungsnummer 40394 hinterlegt ist. 

55 52. DNA-Polynucleotid nach einem der AnsprOche 46 bis 48, dessen Sequenz in einer der Figuren 1 , 3 bis 
32, 36, 46 oder 47 gezeigt ist, oder dessen Sequenz selektiv mit dem Polynucleotid hybridisierbar ist, 
das in einer der Figuren 1, 3 bis 32, 36, 46 oder 47 gezeigt ist. 
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53. DNA-Polynucleotid nach einem der AnsprQche 46 bis 52, wobei die aufeinahderfolgende Sequenz von 
einem nichtstruktureilen viralen Protein stammi 

54. DNA-Polynucleotid nach einem der AnsprQche 46 bis 48 Oder 50 bis 52, wobei die aufeinanderfolgende 
5 Sequenz von einem strukturellen viralen Protein stammt. 

55. Rekombinanter Vektor, umfassend eine codierende Sequenz, die ein DNA-Polynucleotid nach einem 
der AnsprQche 46 bis 54 umfaBt. 

w 56. Wirtszelle, transformiert mit einem rekombinanten Vektor nach Anspruch 55, wobei die codierende 
Sequenz funktionell mit einer KontroHsequenz verknQpft ist, die die Expression der codierenden 
Sequenz durch die Wirtszelle ermoglicht. 

57. Verfahren zur Herstellung eines rekombinanten HCV-Polypeptids, umfassend die Inkubation einer 
75 Wirtszelle nach Anspruch 56 unter Bedingungen, die zur Expression der codierenden Sequenz fOhren. 

58. Anti-HCV-Antik5rper-Zusammensetzung, umfassend Antik5rper, die die antigene Determinante eines 
Polypeptids nach einem der AnsprQche 1 bis 12 binden, die (a) ein gereinigtes Praparat von 
polyclonalen Anttkorpern oder (b) eine Zusammensetzung aus monoclonalen Antikorpern ist. 

20 

59. Zusammensetzung nach Anspruch 58, wobei die anti-HCV-Antikorper an eine feste Phase fixiert sind. 

60. Immunoassay-Kit, umfassend eine anti-HCV-Antikorper-Zusammensetzung nach Anspruch 58 oder 59 
in einem geeigneten Behalter. 

25 . • 

61. Immunoassay-Verfahren zum Nachweis eines HCV-Antigens in einer Probe, umfassend: 

(a) Bereitstellung einer anti-HCV-Antikorper-Zusammensetzung: nach Anspruch 58 oder 59; 

(b) Inkubation einer Probe mit der anti-HCV-AntikSrper-Zusammensetzung unter Bedingungen, die 
die Ausbildung eines AntikQrper-Antigen-Komplexes erlauben; und 

30 (c) Bestimmung, ob ein den anti-HCV-Antikorper umfassender Antikorper-Antigen-Komplex ausgebil- 

det wurde. 

62. Polypeptid, umfassend eine Sequenz von mindestens 10 aufeinanderfolgenden Aminosauren, die durch 
das Genom von Hepatitis C-Virus (HCV) codiert wird und eine antigene Determinante umfaBt, wobei die 

35 aufeinanderfolgende Sequenz mit einer nicht-HGV-AminosSuresequenz fusioniert ist, und wobei HCV 
folgendermaBen charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaGt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine Aminosauresequenz, die mindeistens 40 % Homoiogie mit der 859 
40 Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

63; Polypeptid nach Anspruch 62, wobei das Polyprotein eine Aminosauresequenz umfaBt, die mindestens 
60 % Homoiogie mit der 859 Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

45 64. Polypeptid nach Anspruch 62 oder 63, wobei die nicht-HCV-Aminosauresequenz eine Signalsequenz 
umfaBt. 

65. Polypeptid nach Anspruch 62 oder 63, wobei die nicht-HCV-Aminosauresequenz eine Aminosaurese- 
quenz von 0-Galactosidase oder Superoxiddismutase umfaBt. 

50 

66. Polypeptid nach Anspruch 62 oder 63, wobei die nicht-HCV-Aminosauresequenz ein teilchenbildendes 
Protein umfaBt. 

67. Polypeptid nach Anspruch 66, wobei das teilchenbildende Protein Hepatitis B-OberflSchenantigen 
55 umfaBt. 

68. Polypeptid nach einem der AnsprQche 1 bis 12 oder 62 bis 67 zur Verwendung in einem Verfahren zur 
Herstellung von anti-HCV-Antikorpern, umfassend die Verabreichung des Polypeptids an einen Sauger 
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in einer Menge, die zur Erzeugung einer Immunantwort ausreicht. 

69. Zusammensetzung, umfassend ein Polypeptid nach einem der AnsprOche 62 bis 67 im Gemisch mit 
einem pharmazeutisch vertraglichen Excipienten. 

5 

70. Impfstoff nach Anspruch 69. 

71. Verfahren zur ZOchtung von Hepatitis C-Virus (HCV), umfassend die Bereitstellung von mit HCV 
infizierten Zellen und Vermehrung dieser Zellen in vitro , wobei HCV folgendermaBen charakterisiert ist: 

10 (i) ein positiver Strang des RNA-Genoms, 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Poly protein codiert; und 

(iii) das Polyprotein umfaBt eine Aminosauresequenz, die mindestens 40 % Homologie mit der 859 
Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

75 72. Verfahren nach Anspruch 71, wobei das Polyprotein eine Aminosauresequenz umfaBt, die mindestens 
60 % Homologie mit der 859 Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

73. Verfahren nach Anspruch 71 oder 72, wobei die Zellen primare Zellen umfassen. 

20 74. Verfahren nach Anspruch 71 oder 72, wobei die Zellen eine Zellinie umfassen. 

75. Verfahren nach einem der Ansprtiche 71 bis 74, wobei die Zellen Hepatocyten oder Macrophagen sind. 

76. Hepatitis C-Virus (HCVJ-lmmunoassay-Antigen, fixiert an eine feste Phase, wobei HCV folgendermaBen 
25 charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine Aminosauresequenz, die mindestens 40 % Homologie mit der 859 
AminosSuren umfassenden Sequenz in Fig. 14 aufweist; 

30 wobei das Antigen eine antigene Determinante umfaBt, die immunologisch reaktiv mit einem anti-HCV- 
Antikorper ist, wobei 

(a) der anti-HCV-Antikorper immunologisch reaktiv mit einer als Bezug dienenden antigenen Deter- 
minante ist, die (i) von einer HCV-cDNA-lnsertion in der lambda-gt11-Genbank codiert wird, welche 
bei der American Type Culture Collection (ATCC) unter der Hinterlegungs-Nr. 40394 hinterlegt ist, 

35 oder (ii) in Figur 47 gezeigt ist; und 

(b) die als Bezug dienende antigene Determinante immunologisch reaktiv mit Seren von HCV- 
infizierten Menschen ist. 

77. HCV-Antigen nach Anspruch 76, wobei die als Bezug dienende antigene Determinante in Fig. 14 
40 gezeigt ist. 

Patentanspruche fur folgenden Vertragsstaat : ES 

1. Immunoassay zum Nachweis eines AntikcSrpers gegen Hepatitis C-Virus (HCV) (anti-HCV-Antik8rper), 
45 wobei der Immunoassay folgende Schritte umfaBt: 

(a) Bereitstellung eines Polypeptids, umfassend eine antigene Determinante, an die der anti-HCV- 
Antikorper bindet, wobei die antigene Determinante eine aufeinanderfolgende Aminosauresequenz 
umfaBt, die von dem HCV-Genom codiert wird, und wobei HCV folgendermaBen charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 
so (ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine Aminosauresequenz, die mindestens 40 % Homologie mit der 
859 Aminosauren umfassenden Sequenz in Fig. 14 aufweist, 

(b) Inkubation einer biologischen Probe mit dem Polypeptid unter Bedingungen, die die Ausbildung 
des AntikSrper-Antigen-Komplexes erlauben; und 

55 (c) Bestimmung, ob der das Polypeptid umfassende Antikdrper-Antigen-Komplex ausgebildet wurde. 

2. Immunoassay nach Anspruch 1 , wobei das Polypeptid durch Expression rekombinanter DNA hergestellt 
wurde. 
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3. Immunoassay nach Anspruch 1, wobei das Polypeptid durch chemische Synthese hergestellt wurde. 

4. Immunoassay nach einem der AnsprGche 1 bis 3, wobei das Polypeptid an einen festen Trager 
geknupft ist. 

5 

5. Immunoassay nach einem der AnsprGche 1 bis 4, wobei die Antikorper-Antigen-Komplexe durch 
Inkubation der Komplexe mit einem markierten Antikorper gegen menschliches Immunglobulin nachge- 
wiesen werden. 

w 6. Immunoassay nach Anspruch 5, wobei der Antikorper gegen menschliches Immunglobulin enzymmar- 
kiert ist. 

7. Immunoassay nach einem der AnsprGche 1 bis 6, wobei das Polyprotein eine AminosSuresequenz 
umfaBt, die mindestens 60 % Homologie mit der 859 Aminosauren umfassenden Sequenz in Fig. 14 

75 aufweist. 

8. Immunoassay nach einem der AnsprGche 1 bis 7, wobei die Sequenz mindestens 10 aufeinanderfol- 
gende Aminosauren umfaBt. 

20 9. Immunoassay nach einem der AnsprGche 1 bis 8, wobei die Sequenz mindestens 15 aufeinanderfol- 
gende Aminosauren umfaBt. 

10. Immunoassay nach einem der AnsprGche 1 bis 9, wobei die aufeinanderfolgende Sequenz in Fig. 14 
gezeigt ist. 

25 

11. Immunoassay nach einem der AnsprGche 1 bis 9, wobei die aufeinanderfolgende Sequenz in Fig. 47 
gezeigt ist. 

12. Immunoassay nach einem der AnsprGche 1 bis 9, wobei die aufeinanderfolgende Sequenz eine 
30 Sequenz ist, wie sie in einer der Figuren 1 , 3 bis 32, 36, 46 Oder 47 gezeigt ist, Oder deren Sequenz 

von einem Polynucleotid codiert wird, das selektiv mit dem Polynucleotid hybridisierbar ist, wie es in 
einer der Figuren 1 , 3 bis 32, 36, 46 oder 47 gezeigt ist. 

13. Immunoassay nach einem der AnsprGche 1 bis 12, wobei die aufeinanderfolgende Sequenz innerhalb 
35 der lambda-gt11-cDNA-Genbank codiert ist, die bei der American Type Culture Collection (ATCC) unter 

der Hinterlegungs-Nr. 40394 hinterlegt ist. 

14. Immunoassay nach einem der AnsprGche 1 bis 13, wobei die aufeinanderfolgende Sequenz von einem 
nichtstrukturellen viralen Protein stammt. 

40 

15. Immunoassay nach einem der AnsprGche 1 bis 9 oder 11 bis 13, wobei die aufeinanderfolgende 
Sequenz von einem strukturellen viraien Protein stammt. 

16. Verfahren zur DurchfGhrung einer Polymerasekettenreaktion, wobei die Primer ein Paar von Polynucleo- 
45 tiden sind, wobei die Polynucleotide jeweils eine Sequenz von aufeinanderfolgenden Nucleotiden 

umfassen, die zur selektiven Hybridisierung mit dem Genom von Hepatitis C- Virus (HCV) oder dem 
Komplement davon fShig ist, wobei HCV folgendermaBen charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

so (iii) das Polyprotein umfaBt eine AminosSuresequenz, die mindestens 40 % Homologie mit der 859 

Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

17. Verfahren nach Anspruch 16, wobei das Polyprotein eine AminosSuresequenz umfaBt, die mindestens 
60 % Homologie mit der 859 Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

55 

18. Verfahren nach Anspruch 16 oder 17, wobei die Polynucleotidsequenz mindestens 10 aufeinanderfol- 
gende Nucleotide umfaBt. 
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19. Verfahren nach Anspruch 18, wobei die Polynucleotidsequenz mindestens 15 aufeinanderfolgende 
Nucleotide umfaBt. 

20. Verfahren nach Anspruch 19, wobei die Polynucleotidsequenz mindestens 20 aufeinanderfolgende 
5 Nucleotide umfaBt. 

21. Verfahren nach einem der Anspruche 16 bis 20, wobei das Polynucleotid ein DNA-Polynucleotid ist. 

22. Verfahren nach einem der AnsprOche 16 bis 21, weiterhin umfassend die Verwendung einer Polynu- 
10 cleotidsonde, die zur selektiven Hybridisierung mit einem Bereich des HCV-Genoms fahig ist, das 

zwischen den HCV-Sequenzen liegt, von denen die Primer stammen, und diese nicht umfaBt. 

23. Verfahren zum Testen einer Probe auf die Gegenwart Oder Abwesenheit von HCV-Polynucleotiden, 
umfassend: 

75 (a) Inkontaktbringen der Probe mit einer Sonde, die ein Polynucleotid gemaS der Definition in einem 

der Anspruche 16 bis 21 umfaBt, unter Bedingungen, die die selektive Hybridisierung der Sonde mit 
einem HCV-Polynucleotid Oder dem Komplement davon in der Probe ermSglichen; und 
(b) Bestimmung, ob Polynucleotidduplexe gebildet wurden, die die Sonde umfassen. 

20 24. Verfahren nach Anspruch 23, wobei die Sonde ein RNA-Polynucleotid ist. 

25. Verfahren nach Anspruch 23 oder 24, wobei die Sonde an eine teste Phase fixiert ist. 

26. Verfahren nach einem der Anspruche 23 bis 25, wobei die Sonde weiterhin eirien nachweisbaren 
25 Marker umfaBt. 

27. Verfahren zur Herstellung eines rekombinanten HCV-Polypeptids, umfassend die Inkubation einer mit 
einem rekombinierten Vektor transformierten Wirtszelle, wobei der Vektor eine codierende Sequenz 
eines DNA-Polynucleotids, das ein Polypeptid codiert, umfaBt, wobei das Polypeptid eine Sequenz von 

30 mindestens 10 aufeinanderfolgenden AminosSuren umfaBt, dje von dem Genom von Hepatitis C^Virus 
(HCV) codiert wird und eine antigene Determinante umfaBt, wobei HCV folgendermaBen charakterisiert 
ist: 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

35 (iii) das Polyprotein umfaBt eine AminosSuresequenZj die mindestens 40 % Homologie mit der 859 

AminosSuren umfassenden Sequenz in Fig. 14aufweist, 
wobei die codierende Sequenz funktionell mit einer Kontrollsequenz verknupft ist, die die Expression 
der codierenden Sequenz durch die Wirtszelle unter Bedingungen ermoglicht, die zur Expression der 
codierenden Sequenz fUhren. 

40 

28. Verfahren nach Anspruch 27, wobei das Polyprotein eine AminosSuresequenz umfaBt, die mindestens 
60 % Homologie mit der 859 AminosSuren umfassenden Sequenz in Fig. 14 aufweist. . 

29. Verfahren nach Anspruch 27 oder 28, wobei die Sequenz mindestens 15 aufeinanderfolgende Amino- 
45 sauren codiert. 

30. Verfahren nach einem der AnsprOche 27 bis 29, wobei die aufeinanderfolgende Sequenz in Fig. 14 
gezeigt ist. 

50 31. Verfahren nach einem der AnsprOche 27 bis 29, wobei die aufeinanderfolgende Sequenz in Fig. 47 
gezeigt ist. 

32. Verfahren nach einem der AnsprOche 27 bis 29, wobei die aufeinanderfolgende Sequenz innerhalb der 
lambda-gt11-cDNA-Genbank codiert ist, die bei der American Type Culture Collection (ATCC) unter der 

55 Hinterlegungsnummer 40394 hinterlegt ist. 

33. Verfahren nach einem der AnsprOche 27 bis 29, wobei die DNA-Polynucleotidsequenz wie in einer der 
Figuren 1, 3 bis 32, 36, 46 oder 47 gezeigt ist, Oder deren Sequenz selektiv mit dem Polynucleotid 
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hybridisierbar ist, das in einer der Figuren 1, 3 bis 32, 36, 46 Oder 47 gezeigt ist. 

34. Verfahren nach einem der AnsprUche 27 bis 33, wobei die aufeinanderfolgende Sequenz von einem 
nichtstrukturellen viralen Protein stammt. 

s 

35. DNA-Polynucleotid nach einem der AnsprUche 27 bis 29 Oder 31 bis 33, wobei die aufeinanderfolgende 
Sequenz von einem strukturellen viralen Protein stammt. 

36. Verfahren, umfassend das Mischen eines Polypeptids in im wesentlichen isolierter Form, hergestellt 
io gemaB dem Verfahren nach einem der AnsprUche 26 bis 35, Oder ein Polypeptid mit der gleichen 

Sequenz, das durch chemische Synthese hergestellt wurde, mit eiriem pharmazeutisch vertrSglichen 
Excipienten. 

37. Verfahren nach Anspruch 36 fur die Herstellung eines Impfstoffs. 

75 

38. Verfahren nach Anspruch 27 oder 28, wobei die aufeinanderfolgende Sequenz mit einer nicht-HCV- 
AminosSuresequenz fusioniert ist. 

39. Verfahren nach Anspruch 38, wobei die nicht-HCV-Aminosauresequenz eine Signalsequenz umfaBt. 

20 

40. Verfahren nach Anspruch 38, wobei die nicht-HCV-Aminosauresequenz eine Aminosauresequenz yon 
0-Galactosidase oder Superoxiddismutase umfaBt. 

41. Verfahren nach Anspruch 38, wobei die nicht-HCV-Aminosauresequenz ein teilchenbildendes Protein 
25 umfaBt. 

42. Verfahren nach Anspruch 41, wobei das teilchenbildende Protein Hepatitis B-Oberf lachenantigen 
umfaBt. 

30 43. Verfahren, umfassend das Mischen eines Polypeptids, hergestellt gemSB dem Verfahren nach einem 
der AnsprUche 38 bis 42, mit einem pharmazeutisch vertraglichen Excipienten. 

44. Verfahren nach Anspruch 43 fUr die Herstellung eines Impfstoffs. 

35 45. Immunoassay-Verfahren zum Nachweis eines HCV-Antigens in einer Probe, umfassend: 

(a) BereitsteNung einer Anti-HCV-Antik6rper-Zusammensetzung, umfassend AntikSrper, die die anti- 
gene Determinante eines Polypeptids nach einem der AnsprUche 1 bis 4 oder 7 bis 15 binden, die 
(a) ein gereinigtes PrSparat von polyclonalen Antikorpern oder (b) eine Zusammensetzung aus 
monoclpnalen Antikorpern ist; 

40 (b) Inkubation einer Probe mit der anti-HCV-Antikorper-Zusammensetzung unter Bedingungen, die 

die Ausbildung eines Antik5rper-Antigen-Komplexes erlaubeh; und 

(c) Bestimmung, ob ein den anti-HCV-Antikorper umfassender Antikorper-Antigen-Komplex ausgebil- 
det wurde. 

45 46. Verfahren nach Anspruch 45, wobei die anti-HCV-Antikorper an eine feste Phase fixiert sind. 

47. Verfahren zur ZUchtung von Hepatitis C-Virus (HCV), umfassend die BereitsteNung von mit HCV 
infizierten Zellen und Vermehrung dieser Zellen in vitro , wobei HCV folgendermaBen charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms, 
so (ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine AminosSuresequenz, die mindestens 40 % Homologie mit der 859 
Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

48. Verfahren nach Anspruch 47, wobei das Polyprotein eine AminosSuresequenz umfaBt, die mindestens 
55 60 % Homologie mit der 859 Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

49. Verfahren nach Anspruch 47 oder 48, wobei die Zellen primare Zellen umfassen. 
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50. Verfahren nach Anspruch 47 oder 48, wobei die Zellen eine Zellinie umfassen. 

51- Verfahren nach einem der AnsprOche 47 bis 50, wobei die Zellen Hepatocyten oder Macrophagen sind. 

5 52. Verfahren zur Herstellung eines Hepatitis C-Virus (HCV)-lmmunoassay-Antigens, wobei HCV folgender- 
maSen charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine AminosSuresequenz, die mindestens 40 % Homoiogie mit der 859 
io Aminosauren umfassenden Sequenz in Fig. 14 aufweist; 

wobei das Antigen eine antigene Determinante umfaSt, die immunologisch reaktiv mit einem anti-HCV- 
Antikorper ist, wobei 

(a) der anti-HCV-Antikorper immunologisch reaktiv mit einer als Bezug dienenden antigenen Deter- 
minante ist, die (i) von einer HCV-cDNA-lnsertion in der lambda-gt11-Genbank codiert wird, welche 

75 bei der American Type Culture Collection (ATCC) unter der Hinterlegungs-Nr. 40394 hinterlegt ist, 

oder (ii) in Figur 47 gezeigt ist; und 

(b) die als Bezug dienende antigene Determinante immunologisch reaktiv mit Seren von HCV- 
infizierten Menschen ist; 

umfassend die Fixierung des Immunoassay-Antigens an eine feste Phase. 

20 

53. Verfahren nach Anspruch 52, wobei die als Bezug dienende antigene Determinante in Fig. 14 gezeigt 
ist. 

Patentanspriiche fur folgenden Vertragsstaat : GR 

25 

1. Immunoassay zum Nachweis eines AntikoVpers gegen Hepatitis C-Virus (HCV) (anti-HCV-Antikorper), 
wobei der Immunoassay folgende Schritte umfaBt: 

(a) Bereitstellung eines Polypeptids, umfassend eine antigene Determinante, an die der anti-HCV- 
Antikdrper bindet, wobei die antigene Determinante eine aufeinanderfolgende Aminosauresequenz 
30 umfaBt, die von dem HCV-Genom codiert wird, und wobei HCV folgendermaBen charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine Aminosauresequenz, die mindestens 40 % Homoiogie mit der 
859 Aminosauren umfassenden Sequenz in Fig. 1 4 aufweist, 

35 (b) Inkubation einer biologischen Probe mit dem Polypeptid unter Bedingungen, die die Ausbildung 

des AntikSrper- Antigen- Komplexes erlauben; und 

(c) Bestimmung, ob der das Polypeptid umfassende AntikSrper-Antigen-Komplex ausgebildet wurde. 

2. Immunoassay nach Anspruch 1, wobei das Polypeptid durch Expression rekombinanter DNA hergestellt 
40 wurde. 

3. Immunoassay nach Anspruch 1 , wobei das Polypeptid durch chemische Synthese hergestellt wurde. 

4. Immunoassay nach einem der AnsprUche 1 bis 3, wobei das Polypeptid an einen festen TrSger 
45 geknUpft ist 

5. Immunoassay nach einem der AnsprOche 1 bis 4, wobei die Antik5rper- Antigen- Komplexe durch 
Inkubation der Komplexe mit einem markierten Antikorper gegen menschliches Immunglobulin nachge- 
wiesen werden. 

50 

6. Immunoassay nach Anspruch 5, wobei der Antikorper gegen menschliches Immunglobulin enzymmar- 
kiert ist. 

7. Immunoassay nach einem der AnsprOche 1 bis 6, wobei das Polyprotein eine Aminosauresequenz 
55 umfaGt, die mindestens 60 % Homoiogie mit der 859 Aminosauren umfassenden Sequenz in Fig. 14 

aufweist. 
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8. Immunoassay nach einem der AnsprUche 1 bis 7, wobei die Sequenz mindestens 10 aufeinanderfol- 
gende Aminosauren umfaBt 

9. Immunoassay nach einem der AnsprUche 1 bis 8, wobei die Sequenz mindestens 15 aufeinanderfol- 
5 gende Aminosauren umfaBt 

10. Immunoassay nach einem der AnsprUche 1 bis 9, wobei die aufeinanderfolgende Sequenz in Fig. 14 
gezeigt ist. 

io 11. Immunoassay nach einem der AnsprUche 1 bis 9, wobei die aufeinanderfolgende Sequenz in Fig. 47 
gezeigt ist. 

12. Immunoassay nach einem der AnsprUche 1 bis 9, wobei die aufeinanderfolgende Sequenz eine 
Sequenz ist, wie sie in einer der Figuren 1, 3 bis 32, 36, 46 oder 47 gezeigt ist, Oder deren Sequenz 

75 von einem Polynucleotid codiert wird, das selektiv mit dem Polynucleotid hybridisierbar ist, wie es in 
einer der Figuren 1 , 3 bis 32, 36, 46 oder 47 gezeigt ist. 

13. Immunoassay nach einem der AnsprUche 1 bis 12, wobei die aufeinanderfolgende Sequenz innerhalb 
der Iambda-gt1 1-cDNA-Genbank codiert ist, die bei der American Type Culture Collection (ATCC) unter 

20 der Hinterlegungs-Nr. 40394 hinterlegt ist. 

14. Immunoassay nach einem der AnsprUche 1 bis 13, wobei die aufeinanderfolgende Sequenz von einem 
nichtstrukturellen viralen Protein stammt 

25 15. Immunoassay nach einem der AnsprUche 1 bis 9 oder 11 bis 13, wobei die aufeinanderfolgende 
Sequenz von einem strukturellen viralen Protein stammt 

16. Verfahren zur Durchfuhrung einer Polymerasekettenreaktion, wobei die Primer ein Paar von Poiynucleo- 
tiden sind, wobei die Polynucleotide jeweils eine: Sequenz von aufeinanderfolgenden Nucleotiden 

30 umfassen, die zur selektiven Hybridisierung mit dem Genom von Hepatitis C-Virus (HCV) oder dem 
Komplement davon fahig ist, wobei HCV folgendermafien charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein off enes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine AminosSuresequenz, die mindestens 40 % Homologie mit der 859 
35 Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

17. Verfahren nach Anspruch 16, wobei das Polyprotein eine AminosSuresequenz umfafit, die mindestens 
60 % Homologie mit der 859 Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

40 18. Verfahren nach Anspruch 16 oder 17, wobei die Polynucleotidsequenz mindestens 10 aufeinanderfol- 
gende Nucleotide umfaBt. 

19. Verfahren nach Anspruch 18, wobei die Polynucleotidsequenz mindestens 15 aufeinanderfolgende 
Nucleotide umfaBt. 

45 

20. Verfahren nach Anspruch 19, wobei die Polynucleotidsequenz mindestens 20 aufeinanderfolgende 
Nucleotide umfaBt. 

21. Verfahren nach einem der AnsprUche 16 bis 20, wobei das Polynucleotid ein DNA-Polynucleotid ist. 

50 

22. Verfahren nach einem der AnsprUche 16 bis 21, weiterhin umfassend die Verwendung einer Polynu- 
cleotidsonde, die zur selektiven Hybridisierung mit einem Bereich des HCV-Genoms fahig ist, das 
zwischen den HCV-Sequenzen liegt, von denen die Primer stam men, und diese nicht umfaBt 

55 23. Verfahren zum Testen einer Probe auf die Gegenwart oder Abwesenheit von HCV-Polynucleotiden, 
umfassend: 

(a) Inkontaktbringen der Probe mit einer Sonde, die ein Polynucleotid gemaB der Definition in einem 
der AnsprUche 16 bis 21 umfaBt, unter Bedingungen, die die selektive Hybridisierung der Sonde mit 
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einem HCV-Polynucleotid Oder dem Komplement davon in der Probe ermoglichen; und 
(b) Bestimmung, ob Polynucleotidduplexe gebildet wurden, die die Sonde umfassen. 

24. Verfahren nach Anspruch 23, wobei die Sonde ein RNA-Polynucleotid ist. 

5 

25. Verfahren nach Anspruch 23 oder 24, wobei die Sonde an eine teste Phase fixiert ist. 

26. Verfahren nach einem der AnsprGche 23 bis 25, wobei die Sonde weiterhin einen nachweisbaren 
Marker umfaBt. 

70 " 

27. Verfahren zur Herstellung eines rekombinanten HCV-Polypeptids; umfassend die lnkut>ation einer mit 

einem rekombinierten Vektor transformierten Wirtszelle, wobei der Vektor eine codierende Sequenz 
eines DNA-Polynucleotids, das ein Polypeptid codiert, umfaSt t wobei das Polypeptid eine Sequenz von 
mindestens 10 aufeinanderfolgenden Aminosauren umfaSt, die von dem Genom von Hepatitis C-Virus ; •' 
15 (HCV) codiert wird und eine antigene Determinante umfaBt, wobei HCV folgendermaflen charakterisiert 
ist: .. . 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaSt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(tii) das Polyprotein umfaflt eine Aminosauresequenz, die mindestens 40 % Homologie mit der 859 
20 AminosSuren umfassenden Sequenz in Fig. 14 aufweist, 

wobei die codierende Sequenz funktionell mit einer Kontroilsequenz yerknupft ist, die die Expression 
der codierenden Sequenz durch die Wirtszelle unter Bedingungen ermoglicht, die zur Expression der 
codierenden Sequenz fuhren. 

25 28. Verfahren nach Anspruch 27, wobei das Polyprotein eine Aminosauresequenz umfaBt, die mindestens 
60 % Homologie mit der 859 AminosSureVi umfassenden Sequenz in Fig. 14 aufweist. 

29. Verfahren nach Anspruch 27 Oder 28, wobei die Sequenz mindestens 15 aufeinanderfolgende Amino- 
sSuren codiert. 

30 . " - ':' '•■ - 

30. Verfahren nach einem der AnsprGche 27 bis 29, wobei die aufeinanderfolgende Sequenz in Fig. 14 
gezeigt ist 

31. Verfahren nach einem der AnsprGche 27 bis 29, wobei die aufeinanderfolgende Sequenz in Fig. 47 
35 gezeigt ist. 

32. Verfahren nach einem der AnsprUche 27 bis 29, wobei die aufeinanderfolgende Sequenz innerhalb der 
lambda-gtl 1 -cDNA-Genbank codiert 1st, die bei der American Type Culture Collection (ATCC) unter der 
Hinterlegungsnummer 40394 hinterlegt ist. 

40 

33. Verfahren nach einem der AnsprGche 27 bis 29, wobei die DNA-Polynucleotidsequenz wie in einer der 
Figuren 1, 3 bis 32, 36, 46 oder 47 gezeigt ist, oder deren Sequenz selektiv mit dem Polynucleotid ; 
hybridisierbar ist, das in einer der Figuren 1, 3 bis 32, 36, 46 oder 47 gezeigt ist. 

45 34. Verfahren nach einem der AnsprGche 27 bis 33, wobei die aufeinanderfolgende Sequenz von einem 
nichtstrukturellen viralen Protein stammt. 

35. DNA-Polynucleotid nach einem der AnsprOche 27 bis 29 oder 31 bis 33, wobei die aufeinanderfolgende 
Sequenz von einem strukturellen viralen Protein stammt. 

50 

36. Verfahren, umfassend das Mischen eines Polypeptids in im wesentlichen isolierter Form, hergestellt 
gemaB dem Verfahren nach einem der AnsprGche 26 bis 35, oder ein Polypeptid mit der gleichen 
Sequenz, das durch chemische Synthese hergestellt wurde, mit einem pharmazeutisch vertrSglichen 
Excipienten. 

55 

37. Verfahren nach Anspruch 36 fOr die Herstellung eines Impfstoffs. 
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38. Verfahren nach Anspruch 27 oder 28, wobei die aufeinanderfolgende Sequenz mit einer nicht-HCV- 
AminosSuresequenz fusioniert ist. 

Verfahren nach Anspruch 38, wobei die nicht-HCV-AminosSuresequenz eine Signalsequenz umfaBt. 

Verfahren nach Anspruch 38, wobei die nicht-HCV-Aminosauresequenz eine Aminosauresequenz von 
£-Galactosidase oder Superoxiddismutase umfaBt. 

41. Verfahren nach Anspruch 38, wobei die nicht-HCV-AminosSuresequenz ein teilchenbildendes Protein 
70 umfaBt. 

42. Verfahren nach Anspruch 41, wobei das teilchenbildende Protein Hepatitis B-Oberflachenantigen 
umfaBt. 

75 43. Verfahren, umfassend das Mischen eines Polypeptids, hergestellt gernaB dem Verfahren nach einem 
der AnsprOche 38 bis 42, mit einem pharmazeutisch vertraglichen Excipienten. 

44. Verfahren nach Anspruch 43 fur die Herstellung eines Impfstoffs. 

20 45. I mmunoassay-Verf ah re n zum Nach weis eines HCV- Antigens in einer Probe, umfassend: 

(a) Bereitstellung einer Anti-HCV-Antikorper-Zusammensetzung, umfassend Antikorper, die die anti- 
gene Determinante eines Polypeptids nach einem der AnsprOche 1 bis 4 oder 7 bis 1 5 binden, die 
(a) ein gereinigtes Praparat von polyclonalen Antikdrpern oder (b) eine Zusammensetzung aus 
monoclonalen Antikorpern ist; 

25 (b) Inkubation einer Probe mit der anti-HCV-Antikorper-Zusammensetzung unter Bedingungen, die 

die Ausbildung eines Antikorper-Antigen-Komplexes erlauben; und 

(c) Bestimmung, ob ein den anti-HCV- Antikorper umfassender Antikorper- Antigen- Komplex ausgebil- 
det wurde. 

30 46. Verfahren nach Anspruch 45, wobei die anti-HCV- Antikorper an eine feste Phase fixiert sind. 

Verfahren zur ZOchtung von Hepatitis C-Virus (HCV), umfassend die Bereitstellung von mit HCV 
infizierten Zellen und Vermehrung dieser Zellen in vitro, wobei HCV folgendermafien charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms, 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine AminosSuresequenz, die mindestens 40 % Homologie mit der 859 
Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

Verfahren nach Anspruch 47, wobei das Polyprotein eine Aminosauresequenz umfaBt, die mindestens 
60 % Homologie mit der 859 Aminosauren umfassenden Sequenz in Fig. 14 aufweist. 

Verfahren nach Anspruch 47 oder 48, .wobei die Zellen primare Zellen umfassen. 

50. Verfahren nach Anspruch 47 oder 48, wobei die Zellen eine Zellinie umfassen. 

45 

51. Verfahren nach einem der AnsprOche 47 bis 50, wobei die Zellen Hepatocyten oder Macrophagen sind. 

52. Verfahren zur Herstellung eines Hepatitis C-Virus (HCV)-lmmunoassay-Antigens, wobei HCV folgender- 
maBen charakterisiert ist: 

50 (i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine Aminosauresequenz, die mindestens 40 % Homologie mit der 859 
AminosSuren umfassenden Sequenz in Fig. 14 aufweist; 

wobei das Antigen eine antigene Determinante umfaBt, die immunologisch reaktiv mit einem anti-HCV- 
55 Antikorper ist, wobei 

(a) der anti-HCV-Antikfirper immunologisch reaktiv mit einer als Bezug dienenden antigenen Deter- 
minante ist, die (i) von einer HCV-cDN A- Insertion in der lambda-gt11-Genbank codiert wird, welche 
bei der American Type Culture Collection (ATCC) unter der Hinterlegungs-Nr. 40394 hinterlegt ist, 


39. 

5 

40. 


47. 

35 


48. 

40 

49. 
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20 


Oder (ii) in Figur 47 gezeigt ist; und 

(b) die als Bezug dienende antigene Determinante immunologisch reaktiv mit Seren von HCV- 
infizierten Menschen ist; 
umfassend die Fixierung des Immunoassay^ Antigens an eine teste Phase. 

5 

53. Verfahren nach Anspruch 52, wobei die als Bezug dienende antigene Determinante in Fig. 14 gezeigt 
ist. 

54. Immobilisiertes Polypeptid zur Verwendung in einem Immunoassay nach einem der AnsprOche 1 bis 
jo 15. wobei das Polypeptid eine antigene Determinante umfaBt, die durch einen anti-HCV-Antikdrper 

gemSB der Definition in Anspruch 1 gebunden wird. 

55. Polypeptid in im wesentlichen isolierter Form, umfassend eine Sequenz von mindestens 10 aufeinan- 
derfolgenden Aminosauren, die von dem Genom des Hepatitis C-Virus (HCV) codiert wird und eine 

15 antigene Determinante umfaBt, wobei HCV charakterisiert ist durch: 

(i) einen positiven Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine Aminosauresequenz, die mindestens 40 % Homologie mit der 859 
Aminosauren umfassenden Sequenz in Figur 14 aufweist. 


56. Polypeptid nach Anspruch 55, gemaS der Definition in einem der AnsprOche 2 bis 4 Oder 7 bis 15. 


57. Polynucleotid in im wesentlichen isolierter Form, umfassend eine Sequenz von aufeinanderfolgenden 
Nucleotides die zur selektiven Hybridisierung mit dem Genom yon Hepatitis C-Virus (HCV) oder dem 

25 Komplement davon fahig ist, wobei HCV folgendermaBen charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine Aminosauresequenz, die mindestens 40 % Homologie mit der 859 
AminosSuren umfassenden Sequenz in Fig. 14 aufweist. 

30 

58. ' Polynucleotid nach Anspruch 57, gemaB der Definition in einem der AnsprOche 17 bis 21 oder 24 bis 
26. 

59. Testkit, umfassend eine Polynucleotidsonde nach Anspruch 58 in einem geeigneten Behalter. 

35 ' " ' - ; • . " ; 

60. Polymerasekettenreaktions(PCR)-Kit, umfassend ein Paar von Primern, die zum Primen der Synthese 
von cDNA in einer PCR-Reaktion fahig sind, wobei jeder der Primer ein Polynucleotid gemSfl der 
Definition in einem der AnsprOche 16 bis 21 ist. 

40 61. PCR-Kit nach Anspruch 60, weiterhin eine Polynucleotidsonde umfassend, die: zur selektiven Hybridis- 
ierung mit einem Bereich des HCV-Genoms fahig ist, das zwischen den HCV-Sequenzen liegt, von 
denen die Primer stammen, und diese nicht umfaBt. 

62. DNA-Polynucleotid, das ein Polypeptid codiert, wobei das Polypeptid eine Sequenz von mindestens 10 
45 . aufeinanderfolgenden Aminosauren umfaBt, die von dem Genom von Hepatitis C-Virus (HCV) codiert 

wird und eine antigene Determinante umfaBt, wobei HCV folgendermaBen charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine Aminosauresequenz, die mindestens 40 % Homologie mit der 859 
so AminosSuren umfassenden Sequenz in Fig. 14 aufweist. 

63. DNA-Polynucleotid nach Anspruch 62, gemaB der Definition in einem der AnsprOche 28 bis 35. 

64. Rekombinanter Vektor, umfassend eine codierende Sequenz, die ein DNA-Polynucleotid nach einem 
55 der AnsprOche 62 oder 63 umfaBt. 

65. Wirtszelle, transformiert mit einem rekombinanten Vektor nach Anspruch 64, wobei die codierende 
Sequenz funktionell mit einer Kontrollsequenz verknOpft ist, die die Expression der codierenden 
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Sequenz durch die Wirtszelle ermftglicht. 

66. Anti-HCV-Antikorper-Zusammensetzuhg, umfassend Antikorper, die die antigene Determinante eines 
Polypeptids nach einem der AnsprOche 55 Oder 56 bindeh, die (a) ein gereinigtes PrSparat von 

s polyclonalen Antikorpern Oder (b) eine Zusammensetzung aus monoclonalen Antikorpern ist. 

67. Mittel nach Anspruch 66, wobei die anti-HCV-Antik5rper an eine feste Phase fixiert sind. 

68. Immunoassay-Kit, umfassend eine anti-HCV-AntikSrper-Zusammensetzung nach Anspruch 66 Oder 67 
;o in einem geeigneten Behalter. 

69. Polypeptid, umfassend eine Sequenz von mindestens 10 aufeinanderfolgenden Aminosauren, die durch 
das Genom von Hepatitis C-Virus (HCV)' codiert wird und eine antigene Determinante umfaBt, wobei die 
aufeinanderfolgende Sequenz mit einer nicht-HCV-AminosauresequenZ; fusioniert ist, und wobei HCV 

75 folgendermaBen charakterisiert ist: 

(i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine Aminosauresequenz, die mindestens 40 % Homologie mit der 859 
Aminosauren umfassenden Sequenz in Fig. 14 aufweist 


20 


70. Polypeptid nach Anspruch 69, gema'B der Definition in einem der AnsprOche 39 bis 42. 


71. Hepatitis C-Virus (HCV)-lmmunoassay-Antigen, fixiert an eine feste Phase, wobei HCV folgendermaBen 
charakterisiert ist: 

25 (i) ein positiver Strang des RNA-Genoms; 

(ii) das Genom umfaBt ein offenes Leseraster (ORF), welches ein Polyprotein codiert; und 

(iii) das Polyprotein umfaBt eine Aminosauresequenz, die mindestens 40 % Homologie mit der 859 
Aminosa'uren umfassenden Sequenz in Fig. 14 aufweist; 

wobei das Antigen eine antigene Determinante umfaBt, die immunologisch reaktiv mit einem anti-HCV- 

30 Antikorper ist, wobei 

(a) der anti-HCV-Antikorper immunologisch reaktiv mit einer als Bezug dienenden antigenen Deter- 
minante ist, die (i) von einer HCV-cDNA-lnsertion in der lambda-gt11-Genbank codiert wird, welche 
bei der American Type Culture Collection (ATCC) unter der Hinterlegungs-Nr. 40394 hinterlegt ist, 
Oder (ii) in Figur 47 gezeigt ist; und 

35 (b) die als Bezug dienende antigene Determinante immunologisch reaktiv mit Seren von HCV- 

infizierten Menschen ist. 

72. HCV-Antigen nach Anspruch 71, wobei die als Bezug dienende antigene Determinante in Fig. 14 
gezeigt ist. 

40 

Revendications 

Revendications pour les Etats contractants sulvants : AT, BE, CH, DE, FR, GB, IT, LI, LU, NL, SE 

1. Polypeptide sous forme pratiquement isolSe, comprenant une sequence contigue" d'au moins 10 
45 aminoacides code*e par le ge>iome du virus de l' hepatite C (HCV) et comprenant un determinant 

antig^nique, le HCV 4tant caracterise par: 

(I) un genome a ARN form^ d'une chaTne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyprotSine; et 

(III) ladite polyproteMne comprenant une sequence d'aminoacides ayant. une homologie d'au moins. 
so 40 % avec la sequence de 859 aminoacides sur la figure 14. 

2. Polypeptide selon la revendication 1, dans lequel ladite polyproteine comprend une sequence d'ami- 
noacides ayant une homologie d'au moins 60 % avec la sequence de 859 aminoacides sur la figure 14. 

55 3. Polypeptide selon la revendication 1 ou 2, comprenant au moins 1 5 aminoacides. 

4. Polypeptide selon Tune quelconque des revendications pr^c^dentes, prod uit par expression d'ADN 
recombinant. 
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5. Polypeptide selon Tune quelconque des revendications 1 a 3, prepare* par synthese chimique. 

6. Polypeptide selon Tune quelconque des revendications 1 a 5, dans lequel ladite sequence contigue est 
donne*e sur la figure 14. 

5 

7. Polypeptide selon Tune quelconque des revendications 1 a 5, dans lequel ladite sequence contigue est 
donne*e sur la figure 47. 

8. Polypeptide selon rune quelconque des revendications 1 a 5, dans lequel ladite sequence contigue est 
to codee au sein de la banque d'ADNc de \-gt1 1, d^pos^e a V American Type Culture Collection -. 

(ATCC) sous le n° 40394, 

9. Polypeptide selon Tune quelconque des revendications 1 a 8, dans lequel ladite sequence contigue 
provient d'une proline virale non structurale. 

75 

10. Polypeptide selon Tune quelconque des revendications 1 a 5, 7 ou 8, dans lequel ladite sequence 
contigue provient d'une proline virale structurale. 

11. Polypeptide selon Tune quelconque des revendications 1 a 8, dont la sequence est telle que 
20 representee sur Tune quelconque des figures 1, 3 a 32, 36, 46 et 47, ou dont la sequence est cbdee 

dans un polynucleotide apte a I'hybridation selective ayec le polynucleotide tel que repr^sente* sur Tune 
quelconque des figures 1 , 3 a 32, 36, 46 ou 47. 

12. Polypeptide selon Tune quelconque des revendications 1 a 11, caracteVise* en ce que le polypeptide est 
25 fixe sur une phase solide. 

13. Necessaire d'essai immunologique comprenant un polypeptide selon I'une quelconque des revendica- 
tions 1 a 12, dans un contenant approprie\ 

30 14. Composition comprenant un polypeptide sous forme pratiquement isolSe, selon Tune quelconque des 
revendications 1 a 11, melange" avec un excipient pharmaceutiquement acceptable. 

15. Composition de vaccin selon la revendication 14. 

35 16. Essai immunologique pour la detection d'un anticorps dirige* contre le virus de I'hepatite C (HCV) 
(anticorps anti-HCV), lequel essai immunologique comprend: 

(a) la fourniture d'un polypeptide comprenant un determinant antigenique apte a §tre fixe par ledit 
anticorps anti-HCV, ledit determinant antigenique comprenant une sequence contigue d'aminoacides 
codee par le genome du HCV, le HCV etant caracterise par: 

40 (I) un genome a ARN forme d'une chathe positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au rinoins 
40 % avec la sequence de 859 aminoacides sur la figure 14; 

(b) ('incubation d'un echantillon biologique avec ledit polypeptide, dans des conditions permettant la 
45 formation d'un complexe antigerie-anticorps; et 

(c) la determination de la formation d'un complexe antigene-anti corps comprenant ledit polypeptide. 

17. Essai immunologique selon la revendication 16, dans lequel le polypeptide est prod u it par expression 
d'ADN recombinant. 

50 

18. Essai immunologique selon la revendication 16, dans lequel le polypeptide est prepare par synthese 
chimique. 

19. Essai immunologique selon I'une quelconque des, revendications 16 a i8 t dans lequel ledit polypeptide 
55 est fixe sur un support solide. 

20. Essai immunologique selon Tune quelconque des revendications 16 a 19, dans lequel lesdits com- 
plexes antigene-anticorps sont detected par incubation des complexes avec un anticorps anti-immuno- 
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globuline humaine marque. 

21. Essai immunologique de la revendication 20, dans lequel ledit anti corps anti-immunoglobuline humaine 
est marque par une enzyme. 

5 

22. Essai immunologique selon Tune quelconque des revendications 16 a 21, dans lequel ladite polyprotei- 
ne comprend une sequence d'aminoacides ayant une homologie d'au moins 60 % avec la sequence 
de 859 aminoacides sur la figure 14. 

w 23. Essai immunologique selon Tune queiconque des revendications 16 a 22, dans lequella sequence 
contigue comprend au moins 10 aminoacides. 

24. Essai immunologique selon Tune quelconque des revendications 16 a 23, dans lequel la sequence 
contigue comprend au moins 15 aminoacides. 

75 

25. Essai immunologique selon Tune quelconque des revendications 16 a 24, dans lequel la sequence 
contiguS est donn^e sur la figure 14. 

26. Essai immunologique selon Tune quelconque des revendications 16 a 24, dans lequel la sequence 
20 contigue est donnee sur la figure 47. 

27. Essai immunologique selon Tune quelconque des revendications 16 a 24, dans lequel la sequence 
contigue est telle que representee sur Tune quelconque des figures 1, 3 a 32, 36, 46 ou 47, ou dont la 
sequence est cod6e dans un polynucleotide apte a I'hybridation selective avec le polynucleotide tel 

25 que represent^ sur Tune quelconque des figures 1, 3 a 32, 36, 46 ou 47. 

28. Essai immunologique selon Tune quelconque des revendications 16 a 27, dans lequel ladite sequence 
contiguS est codSe au sein de la banque d'ADNc de \-gt11 dSposSe a \* American Type Culture 
Collection (ATCC) sous le n° 40394. 

30 

29. Essai immunologique selon Tune quelconque des revendications 16 a 28, dans lequel ladite sequence 
contigue provient d'une proteine virale non structurale. 

30. Essai immunologique selon Tune quelconque des revendications 16 a 24 ou 26 a 28, dans lequel ladite 
35 sequence contigue provient d'une proline virale structurale. 

31. Polypeptide immobilise pour utilisation dans I'essai immunologique de I'une quelconque des revendica- 
tions 16 a 30, caracterise en ce que le polypeptide comprend un determinant antig^nique apte a etre 
fixe par un anticorps anti-HCV tel que d£fini dans la revendication 16.. 

40 

32. Polynucleotide sous forme pratiquement isoiee, comprenant une sequence contigue de nucleotides qui 
est apte a Thybridation selective avec le genome du virus de rhepatite C (HCV) ou son complement, le 
HCV etant caracterise par 

(I) un genome a ARN forme d'une chaTne positive; 
45 (II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14. 

33. Polynucleotide selon la revendication 32, dans lequel ladite polyproteine comprend une sequence 
so d'aminoacides ayant une homologie d'au moins 60 % avec la sequence de 859 aminoacides sur la 

figure 14. 

34. Polynucleotide selon la revendication 32 ou 33, dans lequel ladite sequence contigue comporte au 
moins 10 nucleotides. 


55 


35. Polynucleotide selon la revendication 34, dans lequel ladite sequence contigue comporte au moins 15 
nucleotides. 
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36. Polynucleotide selon la revendication 35, dans lequel; ladite sequence contigu§ comporte au moins 20 i 
nucleotides. 

37. Polynucleotide selon I'une quelconque des revendications 32 a 36, qui est un polynucleotide d'ADN. 

5 

38. Polynucleotide selon Tune quelconque des revendications 32 a 36, qui est un polynucleotide d'ARN. 

39. Polynucleotide selon Tune quelconque des revendications 32 a 38, fixe sur une phase solide. 

io 40. Sonde comprenant un polynucleotide selon Tune quelconque des revendications 32 a 39 et compre- 
nant en outre un marqueur detectable. 

41. Necessaire d'essai comprenant une sonde polynucleotide selon Tune quelconque des revendications 
32 a 40, dans un contenant approprie. 

15 

42. Necessaire pour la reaction de polymerisation en chaTne (PCR), ; comprenant une paire d'amorces 
capables d'amorcer la synthase d'ADNc dans une reaction PCR, chacune desdites amorces etant un 
polynucleotide selon Tune quelconque des revendications 32 a 37. 

20 43; Necessaire pour PCR selon la revendication 42, comprenant en outre une sonde polynucieotidique apte 
a I'hybridation selective avec une region du genome du HCV situee entre et ne comprenant pas les 
sequences du HCV desquelles sont derivees les amorces. : 

44. Procede d'execution d'une reaction de polymerisation en chaTne, dans lequel les amorces sont une ; 
25 paire de polynucleotides selon Tune quelconque des revendications 32 a 37. 

45. Procede pour I'essai d'un echantillon pour la determination de la presence ou def absence de 
polynucleotides de HCV, comprenant: 

(a) la mise en contact de rechantillon avec une sonde comprenant un polynucleotide selon I'une ; 
30 quelconque des revendications 32 a 40, dans des conditions permettant I'hybridation selective de 

ladite sonde avec un polynucleotide de HCV ou son complement dans rechantillon; et 

(b) la determination de la formation de duplex polynucieotidiques comprenant ladite sonde. 

46. Polynucleotide d'ADN codant pour un polypeptide, lequel polypeptide comprend une sequence 
35 contiguS d'au moins 10 aminoacides codee par le genome du virus de I'hepatite C (HCV) et 

comprenant un determinant antigenique, le HCV etaht caracterise par 

(I) un genome a ARN forme d'une chaTne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homqlogie d'au moins 
40 40 % avec la sequence de 859 aminoacides sur la figure 14. . 

47. Nucleotide d'ADN selon la revendication 46, dans lequel ladite polyproteine comprend une sequence 
d'aminoacides ayant une homologie d'au moins 60 % avec la sequence de 859 aminoacides sur la 

• figure 14. 

45 .. • • . . ■; . ' : ; • ■ . .;• ••• 

4a Polynucleotide d'ADN selon la revendication 46 ou 47, dans lequel ladite sequence contigue code pour 
au moins 15 aminoacides. 

49. Polynucleotide d'ADN selon I'une quelconque des revendications 46 a 48, dans lequel ladite sequence 
so contigue est donnee sur la figure 14. 

50. Polynucleotide d'ADN selon I'une quelconque des revendications 46 a 48, dans lequel ladite sequence 
contigu^ est donnee sur la figure 47. 

55 51. Polynucleotide d'ADN selon I'une quelconque des revendications 46 a 48, dans lequel ladite sequence 
contigue est codee au sein de la banque d'ADNc de \-gt11 deposee a V American type Culture 
Collection (ATCC) sous le n° 40394. 


102 


EP 0 318 216 B1 


52. Polynucleotide d'ADN selon Tune quelconque des revendications 46 a 48, dont ta sequence est telle 
que representee sur Tune quelconque des figures 1 , 3 & 32, 36, 46 ou 47, ou dont la sequence est apte 
a Thybridation selective avec le polynucleotide tel que represents sur Tune quelconque des figures 1 , 3 
a 32, 36, 46 ou 47. 

5 

53. Polynucleotide d'ADN selon I'une quelconque des revendications 46 a 52, dans lequel ladite sequence 
contigue provient d f une proline virale non structurale. 

54. Polynucleotide d'ADN selon Tune quelconque des revendications 46 h 48 ou 50 & 52, dans lequel 
70 ladite sequence contigue provient d'une proteine virale structurale, 

55. Vecteur recombinant comprenant une sequence codante qui comprend un polynucleotide d'ADN selon 
I'une quelconque des revendications 46 a 54. 

75 56. Cellule note transformed par un vecteur recombinant selon la revendication 55, dans lequel la 
sequence codante est fonctionnellement li^e h une sequence regulatrice capable de produire ['expres- 
sion de la sequence codante par la cellule hote. 

57. Procede de production d'un polypeptide recombinant de HCV, comprenant I'incubation d'une cellule 
20 h6te selon la revendication 56, dans des conditions produisant I'expression de la sequence codante. 

58. Composition d'anticorps anti-HCV comprenant des anticorps qui se lient audit determinant antigenique 
d'un polypeptide selon I'une quelconque des revendications 1 h 12, qui est (a) une composition purine 
d'anticorps polyclonaux, ou (b) une composition d'anticorps monoclonaux. 

59. Composition selon la revendication 58, dans laquelle les anticorps anti-HCV sont fixes sur une phase 
solide. 

60. Necessaire d'essai immunologique comprenant une composition d'anticorps anti-HCV selon la revendi- 
30 cation 58 ou 59, dans un contenant approprie. 

61. Procede d'essai immunologique pour la detection d'un antigene de HCV dans un echantiilon, compre- 
nant: 

(a) la fourniture d'une composition d'anticorps anti-HCV selon la revendication 58 ou 59; 
35 (b) I'incubation d'un echantiilon avec ladite composition d'anticorps anti-HCV, dans des conditions 

permettant la formation d'un complexe antigene-anticorps; et 

(c) la determination de la formation d'un complexe antigene-anticorps comprenant I'anticorps anti- 
HCV. 

40 62. Polypeptide comprenant une sequence contigue d'au moins 10 aminoacides codee par le genome du 
virus de I'hepatite C (HCV) et comprenant un determinant antigenique, dans lequel ladite sequence 
contigu§ est soudee a une sequence d'aminoacides non-HCV, le HCV etant caracterise par 

(I) un genome a ARN forme d'une chaTne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

45 (III) ladite polyproteine comprenant une sequence d'aminoacides ay ant une homolbgie d'au moins 

40 % avec la sequence de 859 aminoacides sur la figure 14. 

63. Polypeptide selon la revendication 62, dans lequel ladite polyproteine comprend une sequence 
d'aminoacides ayant une homologie d'au moins 60 % avec la sequence de 859 aminoacides sur la 

50 figure 14. 

64. Polypeptide selon la revendication 62 ou 63, dans lequel ladite sequence d'aminoacides non-HCV 
comprend une sequence signal. 

55 65. Polypeptide selon la revendication 62 ou 63, dans lequel ladite sequence d'aminoacides non-HCV 
comprend une sequence d'aminoacides provenant de la jS-galactosidase ou de la superoxyde dismuta- 
se. 
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66. Polypeptide selon la revendication 62 ou 63, dans lequel la sequence d'aminoacides non-HCV 
comprend une proline formant des particuies. 

67. Polypeptide selon la revendication 66, dans lequel la proline formant des particules comprend 
I'antigene de surface de I'hepatite B. 

68. Polypeptide selon Tune quelconque des revendications 1 a 12 ou 62 a 67, pour utilisation dans un 
procede de production d'anticorps anti-HCV, qui comprend Padministration du polypeptide a un 
mammifere, en une quantity suffisante pour produire une reponse immunitaire. 

69. Composition comprenant un polypeptide selon Tune quelconque des revendications 62 a 67, melange* 
avec un excipient pharmaceutiquement acceptable. 

70. Vaccin selon la revendication 69. 

71. Proc^de de culture du virus de I'hepatite C (HCV) comprenant la fourniture de cellules infect^es par le 
HCV et la propagation desdites cellules in vitro, ledit HCV etant caracte*ris6 par 

(I) un genome a ARN forme* d'une chatne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14. 

72. Procede selon la revendication 71, dans lequel ladite polyproteine comprend une sequence d'aminoaci- 
des ayant une homologie d'au moins 60 % avec la sequence de 859 aminoacides sur la figure 14. 

73. Procede selon la revendication 71 ou 72, dans lequel lesdites cellules comprenrient des cellules 
primaires. 

74. Procede selon la revendication 71 ou 72, dans lequel lesdites cellules comprenrient une lign§e 
cellulaire. 

75. Procede selon Tune quelconque des revendications 71 a 74, dans lequel lesdites cellules sont des 
hepatocytes ou des macrophages. 

76. Antigene pour I'essai immuhologique du virus de I'hepatite C (HCV) et fixe sur une phase solide, le 
HCV etant caracteVise* par 

(I) un genome a ARN forme d ? une chaTne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayarit une homologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14, 

ledit antigene comprenant un determinant antigenique immunologiquement r^actif avec un anticorps ; 
■ anti-HCV, 

(a) ledit anticorps anti-HCV etant immunologiquement rdactif avec un determinant antigenique de 
reference (I) code par un segment d'insertion d'ADNc de HCV dans la banque de \-gt11 d^pos^e a 
I' American Type Culture Collection (ATCC) sous le n° 40394, ou (II) represent^ sur la figure 47; et 

(b) ledit determinant antigenique de reference etant immunologiquement reactif avec des serums 
provenant de sujets humains infectes par le HCV. 

77. Antigene de HCV selon la revendication 76, dans lequel ledit determinant antigenique de reference est 
represente sur la figure 14. 

Revendications pour I'Etat contractant suivant : ES 

1. Essai immunologique pour la detection d'un anticorps dirige contre le virus de I'hepatite C (HCV) 
(anticorps anti-HCV), lequel essai immunologique comprend: 

(a) la fourniture d'un polypeptide comprenant un determinant antigenique apte a §tre fixe par ledit 
anticorps anti-HCV, ledit determinant antigenique comprenant une sequence contigue d'aminoacides 
codee par le genome du HCV, le HCV etant caracterise par: 
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(I) un genome a ARN forme d'une chaTne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine conriprenant une sequence d'aminoacides ayant une hpmologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14; 

5 (b) Tincubation d'un echantillon biologique avec ledit polypeptide, dans des conditions permettant la 

formation d'un complexe antigfene-anticorps; et 

(c) la determination de la formation d'un complexe antig^ne-anticorps comprenant ledit polypeptide. 

2. Essai immunologique selon la revendication 1, dans lequel le polypeptide est produit par expression 
w d'ADN recombinant. 

3. Essai immunologique selon la revendication 1, dans lequel le polypeptide est prepare par synthase 
chimique. 

75 4. Essai immunologique selon Tune quelconque des revendications 1 a 3, dans iequel ledit polypeptide 
est fixe sur un support sol ide. 

5. Essai immunologique selon Tune quelconque des revendications 1 a 4, dans lequel lesdits complexes 
antig&ne-anticorps sont detects par incubation des complexes avec un anticorps anti-immunoglobuline 

20 humaine marque. 

6. Essai immunologique de la revendication 5, dans lequel ledit anticorps anti-immunoglobuline humaine 
est marque par une enzyme. 

25 7. Essai immunologique selon Tune quelconque des revendications 1 h 6, dans lequel ladite poly proteine 
comprend une sequence d'aminoacides ayant une homologie d'au moins 60 % avec la sequence de 
859 aminoacides sur la figure 14. 

8. Essai immunologique selon I'une quelconque des revendications 1 a 7, dans lequel la sequence 
30 contigue comprend au moins 10 aminoacides. 

9. Essai immunologique selon Tune quelconque des revendications 1 a 8, dans lequel la sequence 
contigue comprend au moins 15 aminoacides. 

35 10. Essai immunologique selon Tune quelconque des revendications 1 a 9, dans , lequel la sequence 
contigue est donn£e sur la figure 14. 

11. Essai immunologique selon Tune quelconque des revendications 1 a 9, dans lequel la sequence 
contigue est donn^e sur la figure 47. 

40 

12. Essai immunologique selon Tune quelconque des revendications 1 a 9, dans lequel la sequence 
. contigue est telle que representee sur Tune quelconque des figures 1 , 3 a 32, 36, 46 ou 47, ou dont la 

sequence est cod^e dans un polynucleotide apte a Thybridation selective avec le polynucleotide tef 
que represente sur Tune quelconque des figures 1, 3 a 32, 36, 46 ou 47. 

13. Essai immunologique selon I'une quelconque des revendications 1 a 12, dans lequel ladite sequence 
contiguS est codee au sein de la banque d'ADNc de \-gt11 deposee b V American Type Culture 
Collection (ATCC) sous le n° 40394. 

so 14. Essai immunologique selon I'une quelconque des revendications 1 a 13, dans lequel ladite sequence 
contigue provient d'une proteine virale non structurale. 

15. Essai immunologique selon Tune quelconque des revendications 1 a 9 ou 11 a 13, dans lequel ladite 
sequence contigue provient d'une proteine virale structurale. 

55 

16. Procede d'execution d'une reaction de polymerisation en chaTne, dans lequel les amorces sont une 
paire de polynucleotides, lesquels polynucleotides sont chacun une sequence contigue de nucleotides 
qui est apte a I'hybridation selective avec le genome du virus de I'hepatite C (HCV) ou son 


45 
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complement, le HCV etant caracteVise* par 

(I) un genome a ARN forme d'une chaTne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
5 40 % avec la sequence de 859 aminoacides sur la figure 14. 

17. Procede selon la revendication 16, dans lequel ladite polyproteine comprend une sequence d'aminoaci- 
des ayant une homologie d'au moins 60 % avec la sequence de 859 aminoacides sur la figure 14. 

w 18. Proc^de selon la revendication 16 ou 17, dans lequel ladite sequence polynucieotidique contigue 
comporte au moins 10 nucleotides. 

19. Precede selon la revendication 18, dans lequel ladite sequence polynucieotidique contigue comporte au 
moins 15 nucleotides. 

75 

20. Procede selon la revendication 19, dans lequel ladite sequence polynucieotidique contigue comporte au 
moins 20 nucleotides. 

21. Pro c^de selon Tune quelconque des revendications 16 a 20, dans lequel le polynucleotide est un 
20 polynucleotide d'ADN. 

22. Procede selon Tune quelconque des revendications 16 a 21, qui comprend en outre ('utilisation d'une 
sonde polynucieotidique apte a Thybridation selective avec une region du genome du HCV situee entre 
et ne comprenant pas les sequences du HCV desquelles sont derivees les amorces. 

25 

23. Procede pour I'essai d'un echantillon pour la determination de la presence ou de I'absence de 
polynucleotides de HCV, comprenant: 

(a) la mise en contact de rechantillon avec une sonde comprenant un polynucleotide tel que detini 
dans Tune quelconque des revendications 16 a 21, dans des conditions permettant I'hybridation 

30 selective de ladite sonde avec un polynucleotide de HCV ou son complement dans rechantillon; et 

(b) la determination de la formation de duplex polynucieotidiques comprenant ladite sonde. 

24. Procede selon la revendication 23, dans lequel la sonde est un polynucleotide d'ARN. 

35 25. Procede selon la revendication 23 ou 24, dans lequel la sonde est fixee sur une phase solide. 

26. Procede selon Tune quelconque des revendications 23 a 25, dans lequel la sonde comprend en outre 
un marqueur detectable. 

40 27. Procede de production d'un polypeptide recombinant de HCV, comprenant I'incubation d'une cellule 
note transformee par un vecteur recombinant comprenant une sequence codante d'un polynucleotide 
d'ADN codant pour un polypeptide, lequel polypeptide comprend une sequence contigue d'au moins 
10 aminoacides codee par le genome du virus de ('hepatite C (HCV) et comprenant un determinant 
antigenique, le HCV etant caracterise par 

45 (I) un genome a ARN forme d'une chaTne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14, 

la sequence codante etant fonctionnellement li^e a une sequence regulatrice capable de produire 
so ('expression de la sequence codante par la cellule note, dans des conditions permettant I'expression de 
la sequence codante. 

2a Procede selon la revendication 27, dans lequel ladite polyproteine comprend une sequence d'aminoaci- 
des ayant une homologie d'au moins 60 % avec la sequence de 859 aminoacides sur la figure 14. 

55 

29. Procede selon la revendication 27 ou 28, dans lequel ladite sequence contigue* code pour au moins 15 
aminoacides. 
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30. Proc^dd selon I'une quelconque des revendications 27 a 29, dans lequel ladite sequence contigu§ est ■ 
donn^e sur la figure 14. 

31. Proced^ selon I'une quelconque des revendications 27 a 29, dans lequel ladite sequence contigue est 
5 donn£e sur la figure 47. 

32. Proc6d6 selon I'une quelconque des revendications 27 a 29, dans lequel ladite sequence contigue est 
cod^e au sein de la banque d'ADNc de >-gt11 d^pos^e a I" American type Culture Collection (ATCC) 
sous le n° 40394. 

70 

33. Proc6d§ selon I'une quelconque des revendications 27 a 29, dans lequel la sequence polynucieotidique 
d'ADN est telle que representee sur I'une quelconque des figures 1, 3 a 32, 36, 46 ou 47, ou dont la 
sequence est apte a I'hybridation selective avec le polynucleotide tel que reprdsent£ sur Tune 
quelconque des figures 1 , 3 a 32, 36, 46 ou 47. 

75 

34. Procede selon I'une quelconque des revendications 27 a 33, dans lequel ladite sequence contigue 
provient d'une proteine virale non structural. 

35. Procede selon I'une quelconque des revendications 27 a 29 ou 31 a 33, dans lequel ladite sequence 
20 contigue provient d'une proline virale structural. 

36. Procede comprenant le melange d'un polypeptide sous forme pratiquement isoiee, produit par le 
procede de I'une quelconque des revendications 26 a 35, ou d'un polypeptide de meme sequence 
prepare par synthase chimique, avec un excipient pharmaceutiquement acceptable. ■; 

25 

37. Procede selon la revendication 36, pour la preparation d'un vaccin. 

38. Procede selon la revendication 27 ou 28, dans lequel ladite sequence contigue est soud£e a une 
sequence d'aminoacides non-HCV. 

30 

39. Procede selon la revendication 38, dans lequel ladite sequence d'aminoacides non-HCV comprend une 
sequence signal. 

40. Procede selon la revendication 38, dans lequel ladite sequence d'aminoacides non-HCV comprend une 
35 sequence d'aminoacides provenant de la £-galactosidase ou de la superoxyde dismutase. 

41. Procede selon fa revendication 38, dans lequel la sequence d'aminoacides non-HCV comprend une 
proline formant des particules. 

40 42. Proc^de selon la revendication 41, dans lequel la proline formant des particules comprend Vantigfene 
de surface de I'h^patite B. 

43. Procede comprenant le melange d'un polypeptide produit par le procede selon i'une quelconque des 
revendications 38 a 42, avec un excipient pharmaceutiquement acceptable. 

45 

44. Procede selon la revendication 43. pour la preparation d'un vaccin. 

45. Procede d'essai immunologique pour la detection d'un antig&ne de HCV dans un echantillon, compre- 
nant: 

so (a) la fourniture d'une composition d'anticorps anti-HCV comprenant des anticorps qui fixent ledit 

determinant antig^nique d'un polypeptide tel que d£fini dans I'une quelconque des revendications 1 
a 4 ou 7 a 1 5, qui est 

(a) une composition purine d'anticorps polyclonaux, ou 
<b) une composition d'anticorps monoclonaux; 
55 (b) I'incubation d'un echantillon avec ladite composition d'anticorps anti-HCV, dans des conditions 

permettant la formation d'un complexe antig&ne-anticorps; et 

(c) la determination de la formation d'un complexe antigene-anticorps comprenant I'anticorps anti- 
HCV. 
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46. Proc£d§ selon la revendication 45, dans lequel les anticorps anti-HCV sont fixes sur une phase solide. 

47. Procdde* de culture du virus de I'hgpatite C (HCV) comprenant la fourniture de cellules infectSes par te 
HCV et la propagation desdites cellules in vitro, ledit HCV etant caracteVise* par 

(I) un genome h ARN forme* d'une chatne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14. 

48. Proced^ selon la revendication 47, dans lequel ladite polyproteine comprend une sequence d'aminoaci- 
des ayant une homologie d'au moins 60 % avec la sequence de 859 aminoacides sur la figure 14. 

49. Proc6d6 selon la revendication 47 ou 48, dans lequel lesdites cellules comprennent des cellules 
primaires. 

50. Proc^de* selon la revendication 47 ou 48, dans lequel lesdites cellules comprennent une lign^e 
cellulaire. 

51. Proc6d6 selon Tune quelconque des revendications 47 h 50, dans lequel lesdites cellules sont des 
h^patocytes ou des macrophages. 

52. Proc£d6 de production d'un antigene pour I'essai immunologique du virus de ('hepatite C (HCV), le 
HCV etant caracterise par 

(I) un genome a ARN forme* d'une chatne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14, 

ledit antigene comprenant un determinant antigenique immunologiquement r£actif avec un anticorps 
anti-HCV, 

(a) ledit anticorps anti-HCV etant immunologiquement rSactif avec un determinant antigenique de 
reference (I) code par un segment d'insertion d'ADNc de HCV dans la banque de X-gt1 1 d£pos£e k 
V American Type Culture Collection (ATCC) sous le n° 40394, ou (II) represent^ sur la figure 47; et 

(b) ledit determinant antigenique de reference etant immunologiquement reactif avec des serums 
provenant de sujets humains infectes par le HCV; 

qui comprend la fixation h une phase solide dudit antigene pour I'essai immunologique. 

53. Procede selon la revendication 52, dans lequel ledit determinant antigenique de reference est 
represente sur la figure 14. 

Revendications pour I'Etat contractant sulvant : GR 

1. Essai immunologique pour la detection d'un anticorps dirige contre le virus de ('hepatite C (HCV) 
(anticorps anti-HCV), lequel essai immunologique comprend: 

(a) la fourniture d'un polypeptide comprenant un determinant antigenique apte h §tre fixe par ledit 
anticorps antiHCV, ledit determinant antigenique comprenant une sequence contigue d'aminoacides 
codee par le genome du HCV, le HCV etant caracterise par: 

(I) un genome k ARN forme d'une chaTne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14; 

(b) I'incubation d'un echantillon biologique avec ledit polypeptide, dans des conditions permettant la 
formation d'un complexe antigene-anticorps; et 

(c) la determination de la formation d'un complexe antigene-anticorps comprenant ledit polypeptide. 

2. Essai immunologique selon la revendication 1, dans lequel le polypeptide est produit par expression 
d'ADN recombinant. 
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3. Essai immunologique selon la revendication 1, dans lequelle polypeptide est prepare* par synthese 
chimique. 

4. Essai immunologique selon Tune quelconque das revendications 1 a 3, dans lequel (edit polypeptide 
5 est fixe* sur un support solide. 

5. Essai immunologique selon Tune quelconque des revendications 1 a 4, dans lequel lesdits complexes 
antigene-anticorps sont detectes par incubation des complexes avec un anticorps anti-immunoglobuline 
humaine marque. 

w 

6. Essai immunologique de la revendication 5, dans lequel ledit anticorps anti-immunoglobuline humaine 
est marque* par une enzyme. 

7. Essai immunologique selon Tune quelconque des revendications 1 a 6, dans lequel ladite polyproteine 
75 comprend une sequence d'aminoacides ayant une homologie d'au moins 60 % avec la sequence de 

859 aminoacides sur la figure 14. 

8. Essai immunologique selon Tune quelconque des revendications 1 a 7, dans lequel la sequence 
contigue comprend au moins 10 aminoacides. 

20 

9. Essai immunologique selon Tune quelconque des revendications 1 a 8, dans lequel la sequence 
contigue* comprend au moins 15 aminoacides. 

10. Essai immunologique selon Tune quelconque des revendications 1 a 9, dans lequel la sequence 
25 contigue est donn^e sur la figure 14. 

11. Essai immunologique selon Tune quelconque des revendications 1 a 9, dans lequel la sequence 
contigue" est donn^e sur la figure 47. 

30 12. Essai immunologique selon Tune quelconque des revendications 1 a 9, dans lequel la sequence 
contigue est telle que representee sur Tune quelconque des figures 1 , 3 a 32, 36, 46 ou 47, ou dont la 
sequence est codee dans un polynucleotide apte a ('hybridation selective avec le polynucleotide tel 
que represents sur Tune quelconque des figures 1,3a 32, 36, 46 ou 47. 

35 13. Essai immunologique selon Tune quelconque des revendications 1 a 12, dans lequel ladite sequence 
contigue* est cod§e au sein de la banque d'ADNc de \-gt11 d^posee a V American Type Culture 
Collection (ATCC) sous le n° 40394. 

14. Essai immunologique selon Tune quelconque des revendications 1 a 13, dans lequel ladite sequence 
ao contigue provient d'une proline virale non structural. 

15. Essai immunologique selon Tune quelconque des revendications 1 a 9 ou 11 a 13, dans lequel ladite 
sequence contigue provient d'une proteine virale structurale. 

45 16. Procede* d'execution d'une reaction de polymerisation en chaTne, dans lequel les amorces sont une 
paire de polynucleotides, lesquels polynucleotides sont chacun une sequence contigue* de nucleotides 
qui est apte a Thybridation selective avec le genome du virus de ('hepatite C (HCV) ou son 
complement, le HCV etant caracterise par 

(I) un genome a ARN forme d'une chaTne positive; 
so (II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14. 

17. Procede selon la revendication 16, dans lequel ladite polyproteine comprend une sequence d'aminoaci- 
55 des ayant une homologie d'au moins 60 % avec la sequence de 859 aminoacides sur la figure 14. 

18. Procede selon la revendication 16 ou 17, dans lequel ladite sequence polynucleotidique contigue 
comporte au moins 10 nucleotides. 
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19. Procede selon la revendication 18, dans lequel ladite sequence polynucieotidique contigug comporte au 
moins 15 nucleotides. 

20. Procede selon la revendication 19, dans lequel ladite sequence polynucieotidique contigue comporte au 
5 moins 20 nucleotides. 

21. Procede selon rune quelconque des revendications 16 a 20, dans lequel le polynucleotide est un 
polynucleotide d'ADN. 

io 22. Procede selon Tune quelconque des revendications 16 a 21, qui comprend en outre Tutilisation d'une 
sonde polynucieotidique apte a ('hybridation selective avec une region du genome du HCV situee entre 
et ne comprenant pas les sequences du HCV desquelles sont d^rivees les amorces. 

Procede pour I'essai d'un echantillon pour la determination de la presence ou de I'absence de 
polynucleotides de HCV, comprenant: 

(a) la mise en contact de rechantillon avec une sonde comprenant un polynucleotide tel que defini 
dans I'une quelconque des revendications 16 a 21 1 dans des conditions permettant ('hybridation 
selective de ladite sonde avec un polynucleotide de HCV ou son complement dans rechantillon; et 

(b) la determination de la formation de duplex polynucieotidiques comprenant ladite sonde. 

Procede selon la revendication 23, dans lequel la sonde est un polynucleotide d'ARN. 

25. Procede selon la revendication 23 ou 24, dans lequel la sonde est fixee sur une phase solide. 

25 26. Procede selon I'une quelconque des revendications 23 a 25, dans lequel la sonde comprend en outre 
un marqueur detectable. 

27. Procede de production d'un polypeptide recombinant de HCV, comprenant I'incubation d'une cellu|e 
note transformee par un yecteur recombinant comprenant une sequence codante d'un poly nucleotide 
30 d'ADN codant pour un polypeptide, lequel polypeptide comprend une sequence contigue d'au moins 
10 aminoacides codee par le genome du virus de Thepatite C (HCV) et comprenant un determinant • 
antigenique, le HCV etant caracterise par 

(I) un genome a ARN forme d'une chalhe positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

35 (III) ladite polyproteine comprenant une sequence d 'aminoacides ay ant une homologie d'au. moins 

40 % avec la sequence de 859 aminoacides sur la figure 14, 
la sequence codante etant fonctionnellement liee a une sequence regulatrice capable de produire 
('expression de la sequence codante par la cellule hote, dans des conditions permettant Pexpression de 
la sequence codante. 

40 . : • 

2& Procede selon la revendication 27, dans lequel ladite polyproteine comprend une sequence d'amihoaci- 
des ayant une homologie d'au moins 60 % avec la sequence de 859 aminoacides sur la figure 14. 

29. Procede selon la revendication 27 ou 28, dans lequel ladite sequence contigUe code pour au moins 1 5 
45 aminoacides. 

30. Procede selon Tune quelconque des revendications 27 a 29, dans lequel ladite sequence contigUe est 
donnee sur la figure 14. 

so 31. Procede selon Tune quelconque des revendications 27 a 29, dans lequel ladite sequence contigue est 
donnee sur la figure 47. 

32. Procede selon Tune quelconque des revendications 27 k 29, dans lequel ladite sequence contigue est 
codee au sein de la banque d'ADNc de x-gt1 1 deposee a V American type Culture Collection (ATCC) 

55 sous le n° 40394. 

33. Procede selon I'une quelconque des revendications 27 a 29, dans lequel la sequence polynucieotidique : 
d'ADN est telle que representee sur I'une quelconque des figures 1,3a 32, 36, 46 ou 47, ou dont la 
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sequence est apte a Thybridation selective avec le polynucleotide tel que repr6sent6 sur I'une 
quelconque des figures 1,3 a 32, 36, 46 ou 47. 

34. Procede selon Tune quelconque des revendications 27 a 33, dans lequel ladite sequence contigue 
s provient d'une proline vi rale nori structu rale. 

35. Procede selon Tune quelconque des revendications 27 a 29 ou 31 a 33, dans lequel ladite sequence 
contigue provient d'une proline vi rale structu rale. 

w 36. Procede comprenant le melange d'un polypeptide sous forme pratiquement isoiee, produit par le 
procede de Tune quelconque des revendications 26 a 35, ou d'un polypeptide de meme sequence 
prepare par synthase chimique, avec un excipient pharmaceutiquement acceptable. ; 


75 


37. Proc6de selon la revendication 36, pour la preparation d'un vaccin. 

38. Procede selon la revendication 27 ou 28, dans lequel ladite sequence contigue* est soud^e a une 
sequence d'aminoacides non-HCV. 

39. Procede* selon la revendication 38, dans lequel ladite sequence d'aminoacides non-HCV comprend une 
20 sequence signal. 

40. ProcSde* selon la revendication 38, dans lequel ladite sequence d'aminoacides non-HCV comprend une 
sequence d'aminoacides provenant de la 0-galactosidase ou de la superoxyde dismutase. 

25 41. Procede* selon la revendication 38, dans lequel la sequence d'aminoacides non-HCV comprend une 
proline formant des particules. 

42. Procede selon la revendication 41, dans lequel la proline formant des particules comprend I'antigene 
de surface de I'hepatite B. 


30 


43. Procede comprenant le melange d'un polypeptide produit par le procede selon I'une quelconque des 
revendications 38 a 42, avec un excipient pharmaceutiquement acceptable. 

44. Procede selon la revendication 43 pour la preparation d'un vaccin. 


35 


45. Procede d'essai immunologique pour la detection d'un antigene de HCV dans un echantillon, compre- 
nant: 

(a) la fourniture d'une composition d'anticorps anti-HCV comprenant des anticorps qui fixent ledit 
determinant antigenique d'un polypeptide tel que defini dans I'une quelconque des revendications 1 
40 a 4 ou 7 a 15, qui est 

(a) une composition purifies d'anticorps polyclonaux, ou 

(b) une composition d'anticorps monoclonaux; 

(bj Tiricubation d'un echantillon avec ladite composition d'anticorps anti-HCV, dans des conditions 
permettant la formation d'un complexe antigene-anticorps; et 
45 (c) la determination de la formation d'un complexe antigene-anticorps comprenant I'anticorps anti- 

" HCV. ■ 

46. Procede. selon la revendication 45, dans lequel les anticorps anti-HCV sont fixes sur une phase solide. 

so 47. Procede de culture du virus de ('hepatite C (HCV) comprenant la fourniture de cellules infectees par te 
HCV et la propagation desdites cellules in vitro, ledit HCV etant caracterise par 

(I) un genome a ARN forme d'une chatne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
55 40 % avec la sequence de 859 aminoacides sur la figure 14. 

48. Procede selon la revendication 47, dans lequel ladite polyproteine comprend une sequence d'aminoaci- 
des ayant une homologie d'au moins 60 % avec la sequence de 859 aminoacides sur la figure 14. 
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49. Proc6d6 selon ta revendication 47 ou 48, dans lequel lesdites cellules comprennent des cellules 
primaires. 

50. ProcSde* selon la revendication 47 ou 48, dans lequel lesdites cellules comprennent une lign§e 
5 cellulaire. 

51. Proce*de* selon Tune quelconque des revendications 47 a 50, dans lequel lesdites cellules sont des 
hepatocytes ou des macrophages. 

10 52. Procede de production d'un antigene pour I'essai immunologique du virus de I'hepatite C (HCV), le 
HCV etant caracterise par 

(I) un genome a ARN forme d'une chatne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
75 40 % avec la sequence de 859 aminoacides sur la figure 14, 

ledit antigene comprenant un determinant antigenique immunologiquement reactif avec un anticorps 
anti-HCV, 

(a) ledit anticorps anti-HCV etant immunologiquement reactif avec un determinant antigenique de 
reference (I) code par un segment d'insertion d'ADNc de HCV dans la banque de \-gt1 1 dgposSe a 

20 I' American Type Culture Collection (ATCC) sous le n° 40394, ou (II) represents sur la figure 47; et 

(b) ledit determinant antigenique de reference etant immunologiquement reactif avec des serums 
provenant de sujets hu mains infectes par le HCV; 

qui comprend la fixation a une phase solide dudit antigene pour I'essai immunologique. 

25 53. Procede selon la revendication 52, dans lequel ledit determinant antigenique de reference est 
represente sur la figure 14. 

54. Polypeptide immobilise pour utilisation dans Vessai immonologique de Tune quelconque des revendica- 
tions 1 a 15, caracterise en ce que le polypeptide comprend un determinant antigenique apte a etre 

30 fixe par un anticorps anti-HCV tel que defini dans la revendication 1 . 

55. Polypeptide sous forme pratiquement isoiee, comprenant une sequence contigue d'au moins 10 
aminoacides codee par le genome du virus de I'hepatite C (HCV) et comprenant un determinant 
antigenique, le HCV etant caracterise par: 

35 (I) un genome a ARN forme d'une chaTne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14. 

40 56. Polypeptide selon la revendication 55, qui est tel que defini dans I'une quelconque des revendications 
2a 4ou7a 15. 

Polynucleotide sous forme pratiquement isoiee, comprenant une sequence contigue de nucleotides, qui 
est apte a Thybridation selective avec le genome du virus de I'hepatite C (HCV) ou son complement, le 
HCV etant caracterise par 

(I) un genome a ARN forme d'une chaTne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14. 

Polynucleotide selon la revendication 57, qui est tel que defini dans I'une quelconque des revendica- 
tions 17 a 21 ou 24 a 26. 

59. Necessaire d'essai comprenant une sonde polynucleotide selon la revendication 58 dans un contenant 
55 approprie. 

60. Necessaire pour la reaction de polymerisation en chaTne (PCR), comprenant une paire d'amorces 
capables d'amorcer la synthase d'ADNc dans une reaction PCR, chacune desdites amorces etant un 


57. 

45 
50 

58. 
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polynucleotide selon I'une quelconque des reveridications 16 h 21. 

61. Necessaire pour PGR selon la revendication 60, comprenant en outre une sonde polynucleotidique apte 
& I'hybridation selective avec une region du genome du HCV situe'e entre et ne comprenant pas les 

5 sequences du HCV desquelles sont de'rive'es les amorces. 

62. Polynucleotide d'ADN codant pour un polypeptide, lequel polypeptide comprend une sequence 
contigue d'au moins 10 aminoacides code*e par le genome du virus de I'hepatite C (HCV) et : 
comprenant un determinant antig^nique, le HCV 4tant caract^rise par 

w (I) un genome k ARN forme d'une chatne positive; 

(II) (edit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et , 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14. 

75 63. Polynucleotide d'ADN selon la revendication 62, qui est tel que defini dans Tune quelconque des : 
revendications 28 k 35. 

Vecteur recombinant comprenant une sequence codante qui comprend un polynucleotide d'ADN selon 
!a revendication 62 ou 63. 

Cellu|e note transformee par un vecteur recombinant selon la revendication 64, dans lequel la 
sequence codante est fonctionnellement liee & une sequence regulatrice capable de produire ('expres- 
sion de la sequence codante par la cellule hote. 

25 66. Composition d'anticorps anti-HCV comprenant des anticorps qui se lient audit determinant antigenique 
d'un polypeptide selon la revendication 55 ou 56, qui est (a) une composition purifiee d'anticorps 
polyclonaux, ou (b) une composition d'anticorps monoclonaux, 

67. Composition selon la revendication 66, dans laquelle les anticorps. anti-HCV sont fixes sur une phase 
30 solide. 

68. Necessaire d'essai immunologique comprenant une composition d'anticorps anti-HCV selon la revendi- ; 
cation 66 ou 67, dans un contenant approprie. 

35 69. Polypeptide comprenant une sequence contigue d'au moins 10 aminoacides codee par le genome du 
virus de I'hepatite C (HCV) et comprenant un determinant antigenique, dans lequel ladite sequence 
contigue est soudee h une sequence d'aminoacides non-HCV, le HCV etant caracterise par 

(I) un genome ^ ARN forme d'une chatne positive; 

(II) ledit genome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

40 (III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 

40 % avec la sequence de 859 aminoacides sur la figure 14. 

Polypeptide selon la revendication 69, qui est tel que defini dans Tune quelconque des revendications 
39 k 42. 

Antigene pour Tessai immunologique du virus de I'hepatite C (HCV) et fixe sur une phase solide, le 
HCV etant caracterise par 

(I) un genome & ARN forme d'une chatne positive; 

(II) ledit gdnome comprenant un cadre de lecture ouvert (CLO) codant pour une polyproteine; et 

(III) ladite polyproteine comprenant une sequence d'aminoacides ayant une homologie d'au moins 
40 % avec la sequence de 859 aminoacides sur la figure 14, 

ledit antigene comprenant un determinant antigenique immunqlogiquement reactif avec un anticorps 
anti-HCV, 

(a) ledit anticorps anti-HCV etant immunologiquement reactif avec un determinant antigenique de 
reference (I) code par un segment d'insertion d'ADNc de HCV dans la banque de X-gt11 deposeVa 
r American Type Culture Collection (ATCC) sous le n° 40394, ou (II) represente sur la figure 47; et 

(b) ledit determinant antigenique de reference etant immunologiquement reactif avec des serums 
provenant de sujets hu mains infectes par le HCV. 
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72. AntigSne de HCV selon la revendication 71 , dans lequel ledit determinant antig^nique de r6f£rence est 
represents* sur la figure 14. 
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FIG. I Translation of DNA 5-1-1 

AlaSerCysLeuAsnCysSerAlaSerllelleProAspArgGluValLeuTyrArgGlu 
1 GGCCTCCTGCTTGAACTGCTCGGCGAGCATCATACCTGACAGGGAAGTCCTCTACCGAGA 
CCGGAGGACGAACTTGACGAGCCGCTCGTAGTATGGACTGTCCCTTCAGGAGATGGCTCT 

PheAspGluMetGluGluCysSerGlnHisLeuProTyrlleGluGlnGlyMet^etLeu 
61 GTTCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCT 
CAAGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGA 

AlaGluGlnPheLysGlnLysAlaLeuGlyLeu 
121 CGCCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCC 
GCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGG 


FIG. 3 Translation of DNA 5-1-1, 81, 91&1-2 

GlyCysValVallleValGlyArgValValLeuSerGlyLysProAlallelleProAsp 
1 CTGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTG 
GACCGACGCACCAG TATCACCCGTCCCAGC AGAACAGGCCCTTCGG CCGTTAGTATGG AC 

T 

ArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGlnHisLeuProTyr 
6 1 ACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGT 
TGTCCC TTCAGGAG ATGGCTCTCAAGCTAC TCTACCTTCTCACG AG AGTCGTGAATGGCA 
A 

IleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnLysAlaLeuGlyLeuLeuGln 
121 ACATCGAGC^GGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGC 
TGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACG 

ThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrpGlnLysLeu 
181 AG ACCG CGTCCCGTCAGGCAGAGGTTATCGCCCCTGC TGTCCAGACCAACTGGC AAAAAC 
TCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTG 

GluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyrLeuAlaGly 
241 TCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATACTTGGCGG 
AGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATGAACCGCG 

LeuSerThrLeuProGlyAsnProAlalleAlaSexLeuMetAlaPheThrAlaAlaVal 
301 GCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTACAGGTGCTG 
CGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGTCGACGAC 

ThrSerProLeuThrThrSerGln 
361 TCACCAGCCCACTAACCACTAGCCAAA 
AGTGGTCGGGTGATTGGTGATCGGTTT 
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FIG. 4 Translation of DNA 81 

SerGlvLysProAlallelieProAspArgGluValLeuTyrArgGluPheAspGli^et 
1 GTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGAT 
CAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTA 

GluGluCysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPhe 
61 GGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTT 
CCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTC^ 

LysGlnLysAlaLeuGlyl^uI^uGlnT^ 
121 CAAGCAGAAGGCCCTGGGCCTCCTGCAGACCGCGTGGCGTGAGGCAGAC^TTATCGCCCC 
GTTCGTCTTCCGGGAGCCGGAGGA.CGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGG 

AlaValGlnThrAsnTrpGlnLysL^uG^ 
181 TGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTT 

ACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAA 

IleSerGlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAla 
241 CATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGC 
GTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACG 


301 


SerLeuMetAlaPheThrAlaAlaValThrSer ProLeuThrThrSerGln 
TTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAA 
AAGTAACTACCGAAAATGTCGACGACAGTCGTCGGGTGAT^ 


61 


121 


181 


241 


FIG. 5 Translation of DNA 36 

AspAlaHisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAla 
1 GATGCCCACiTTCTATCCCAGAGAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCG 
CTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGC 

TyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrp 
TACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGG 
ATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCGTGGTGTACACC 

LysCysI^uIleArgLeuLysProThrl^uHisGlyProThrProLeuLeuTyrArgLeu 
AAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTG 
TTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGAC 

GlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCys 
GGCGCTGTTGAGAATGAAATCACCCTGACGGAGCCAGTCACCAAATACATCATGACATGC 
CGGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACG 

MetSerAlaAspI^uGiuVaiVal^ 

ATGTCGGCCGACCTGGAGGTCGTCACGAGCAGCTGGGTGCTCGTTCGCGGCGTCCTGGCT 
TACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGA 


AlaLeuAlaAlaTyrCysI^uSerThrGlyCysValVailleValGlyArgValValLeu; 
301 GCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTG 
CGAAACCGGCGGATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAAC 

Overlap with 81 

SerGlyLysProAlallelleProAspArgGluValLeuTyrArg 
361 TCCGGG AAGCCGGCAATCATACCTG ACAGGGAAGTCCTCTACCG AG 
AGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTC 
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FIG. 6 Combined ORF of DMAs 36 & 81 


AspAlaHisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAla 
1 GATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCG 
CTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTC 

TyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrp 
61 TACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGG 
ATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACC 

LysCysLeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeu 
121 AAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCC AACACCCCTGCTATAC AGACTG 
TTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGAC 

GlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCys 
181 GGCGCTGTTCAG AATG AAATCACCCTG ACGCACCCAGTCACCAAAT ACATCATG ACATGC 
CCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGG TCAGTGGTTTATGTAGTAC TGTACG 

MetSerAlaAspI^uGluValValThrSerThrTrpValLeuValGlyGlyValLeuAla 
241 ATGTCGGCCGACCTGG AGGTCGTCACG AGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCT 
TACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGA 

AlaLeuAlaMaTyrCysLeuSerThrGlyCysValVallleValGlyArgValValLeu 
301 GCTTTGGCCGCGTATTGCCTGTGAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTG 
CGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAAC 

SerGlyLysProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluKet 
361 TCCGGG AAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAG AGTTCGATGAGATG 
AGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTAC 

GluGluCysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPhe 
421 G AAGAGTGCTCTCAGCACTTACCGTACATCG AGCAAGGG ATGATGCTCGCCGAGCAGTTC 
CTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAG 

LysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaPro 
481 AAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCT 
TTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGG^ 

AlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhe 
541 GCTGTCCAGACCAACTGGCAAAAACTCGAGACCWCTGGGCGAAGCATATGTGGAACTTC 
CGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACA^ 

IleSerGlylleGlnTyrl^uAlaGlyLeuSerThrl^uPrcGlyAsnProAlalleM 
601 ATCAGTGGGATACAATACTTGGCGGGCrrGTCMCGCTGCCT^ 

TAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGA 

SerLeuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGln 
661 TCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAA^ 
AGTAACTACCX5AAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCOT 
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FIG. 7 Translation of DNA 32 


Overlap with 81— — — ■ 

PheThrAlaAlaValThrSerProI^uThrThrSerGlnThrLeuLeuPheAsnlleLeu 

1 CTTTTACAGCTCCTGTCACCAGCCCACTAACCACTAGC 

GAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATA 

GlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheValGlyAla 
61 TGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCGTTTGTGGGCG 
ACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGC 

GlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeu 
121 CTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCC 
GACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGG 

AlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSerGlyGlu 
181 TTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTGTTGTGGGATTCAAGATCATGAGCGGTG 
AACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCAC 

ValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeu 
241 AGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCC 
TCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGG 

ValValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGluGlyAla, 
301 TCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGG 
AGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCC 

ValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
361 CAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCC CGGGGG AACCATGTTTCCCC . 
GTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGG 


119 


EP 0 318 216 B1 


FIG. 8 Translation of DNA 35 

SerlleGluThrlleThrl^uProGliiAspjaaValSerArgThrGln^^gGly^g 
1 TC^TTCMACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGC^G 

AGOTAACTCT^AGTCCGAGGGGGTCCTACGACAGAGOT 

ThrGlvArqGlyLysProGlylleTyrArgPheValAlaProGlyGluArgProSerGly 
ci I^rrAGGGGGAAGCCA^ 

tSgctca^ 

ThrProAlaGluThrThrValArgLeuArgAlaTyrMetJ^nThxProGlyl^uProVal 
ISS2gccgagacta^ 

rv.srinAsDHislAuGluPheTrpGluGlyValPheThrGlylAuThrHislleAspAla 

241 Sc^gIcCATCTTCAA^ 

A?GG^CTGGTAGAACOTAAAACCCTCCCGCAGAAATC 

HisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuVamaTyrGln 
301 XSSSmSo^ 

GTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATXMACCAT 


Overlap with 36 — ' ~ ~ 

AlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCys 

361 GCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGAXGTGGAAGTGT 

cgg^cacaScgatcccgagttc<k^agggggtagcaccctggtctaca 

^uIleArgLeuLysProThrl^^^ 

421 ttgattcgcctcaagcccaccc^ 

aactaagcggagttcgggt^gaggtacccggttgtggggacgatatgtctc^^ 
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P|Q # 9-| Combined ORP of DNAs 35,36,81 & 32 

SerlleGluThrlleThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlyArg 
1 TCCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGG 
AGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCC 

ThrGlyArgGlyLysProGlylleTYrArgPheValAlaProGlyGluArgProSertly 

61 ACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGG 

TGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCG 

MetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCysAla^pTyrGluLeu 
121 ATCTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGA^ 

TACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTGCGACACGAACCATACTCGAG 

ThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProVal 
181 ACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTCCCC 
TGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTC^ 

CysGlnAspHisI^uGluPheTrpGluGlyValPheThiGlyUsuThrHisIleAspAla 
241 TGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCC 
ACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGG 

HisPheLeuSerGlnThrLysGlnSerGlyGluAsnI«uProTyrI«uValAlaTyrGln 
301 CACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGGGTACCAA 

GTCAAAGATAGGGTCTGTT^ 

AlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMet^pLysCys 
361 GCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATG^AAGTGT 
CGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACA 

LeulleArgl^uLysProThrLeuHisGlyProThxProLeuLeu^A^^GlyAla 
421 TTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTCGGCGCT 
AACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGA 

ValGlnAsnGluIleThrl^uThrHisProValThrLysTyrlleMetThrCysMetSer 
481 GTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATGACATCCATGTCG 
CAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGC 

AlaAspLeuGluValValThrSerThrTrpValLeuValGlyGlyValLeuAlaAlaLeu 
54 1 GCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTC 
CGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAAC 

AlaAlaTyrCysl^uSerThrGlyCysValVallleValGlyAxgValValLei^rGly 
601 GCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTCTCCGGG 
CGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCC 

LysProAlallelleProAspArgGluVallAuTyrAxgGluPheAspGluMetGluGlu 
661 AAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAG 
TTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTC 

CysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLcuAlaGluGlnPheLy^ln 
721 TGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAG 
ACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTC 

LysAlal^uGlyl^uLeuGlnThrMaSerArgGlriAlaGluValll^aPro^aVal 
781 AAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTCTC 
TTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAG 
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GlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSer 
841 CAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGT 
GTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCA 

GlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeu 
901 GGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTG 
CCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAAC 

MetMaPheThrAlaAlaValThrSerProI^uThrThrSerGlnThrl^uLeuPheAsn 
961 ATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACGACTAGCCAAACCCTCCTCTTCAAC 
TACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTG 

IleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheVal 
1021 ATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTG 
TATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACAC 

GlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAsp 
1081 GGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGAC 
CCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTG 

IlelJeuAlaGlyTyrGlyAlaGlyValMaGlyAlaLeuValMaPheLysIleMetSer 
1141 ATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTCTGGCATTCAAGATC 

TAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCG 

GlyGluValProSerThrGluAspLeuValAsnLeuLeuProAlallel^uSerProGly 
1201 GGTGAGGTCCCCTCCACGGAGGACCTGGTCAATGTACTGCCCGCCATCCTCTCGCCCGGA 
CCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCT 

AlaLeuValValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGlu 
1261 GCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAG 
CGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTC 

GlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
1321 GGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
CCCCGTGACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCGCCCTTGGTACAAAGGGG 

FIG. 9-2 
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FIG; I 0 Translation of DNA 37b 


LeuAlaAlaLysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGIyLeuAsp 
1 CTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGAC 
GAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTG 


ValSerVallleProThrSerGlyAspValValValValAlaThrAspAlaLeuMetThr 
61 GTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACC 
CACAGGCAGTAGGGCTGGTCGCCGCTACAAGAGCAGCACCGTTGGCTACGGGAGTACTGG 


GlyTyrThrGlyAspPheAspSerVallleAspTyrAsnThrCysValThrGlnThrVal 
121 GGCTATACCGGCGACTTCGACTCGGTGATAGACTACAATACGTGTGTCACCCAGACAGTC 
CCGATATGGCCGCTGAAGCTGAGCCACTATCTGATGTTATGCACACAGTGGGTCTGTCAG 

- — — — — — - — —Overlap with 

AspPheSerLeuAspProThrPheThrlleGluThrlleThrLeuProGlnAspAlaVal 
181 GATTTCAGCCTTGACCCTACCTTCAGCATTGAGACAATCACGCTCCCCCAGGATGCTGTG 
CTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAG 

■clone 35. — - : — ■ 

SerArgThrGlnArgArgGlyArgThr 
241 TCCCGC ACTCAACGTCGGGGCAGG ACTG 
AGGGCGTGAGTTGCAGCCCCGTCCTGAC 


FIG. II Translation of DNA 33b 


— ; Overlap with 32-^ — -.— — 

MetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSerProThrHisTyrVal 
1 GATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTGCCCGACGCACTACGT 
CTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATGCA 

ProGluSerAspAlaAlaAlaArgValThrAlalleLeuSerSerLeuThrValThrGln 
61 GCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAACCCA 
CGGCCTCTCGCTACGTCGACGGGCGGAGTGACGGTATGAGTGGTCGGAGTGACATTGGGT 

I^uLeuArgArgLeuHisGlnTrpIleSerSerGluCysThrThrProCysSerGlySer 
121 GCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGGTTC 
CGAGGACTCGGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCAAG 

. TrpLeuArgAspIleTrpAspTrpIleCysGluValLeuSerAspPheLysThrTrpLeu 
181 CTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTGGCT . 
GACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACCGA 

LysAlaLysLeuMetProGlnLeuProGiylleProPheValSerCysGlnArgGlyTyr 
241 AAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGGGTA 
TTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCCAT 

LysGlyValTrpArgVal 
301 TAAGGGGGTCTGGCGAGTG 
ATTCCCCCAGACCGCTCAC 
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FIG, I 2 Translation of DNA 40b 

AlaTyrMetSerLysAlaHisGlylleAspProAsnlleArgThrGlyValArgThrlle 
1 GGCTTACATGTCCAAGGCTCATGGGATCGATCCtAACATCAGGACCGGGGTGAGAACAAT 
CCGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTA 

ThrThrGlySerProIleThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCys 
61 TACCACTGGCAGCGCCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTG 
ATGGTG ACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCAC 

SerGlyGlyAlaTyrAspIlellelleCysAspGluCysHisSerThrAspAlaThrSer 
121 CTCGGGGGGCGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATC 
GAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAG 

IleLeuGlylleGlyThrValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuyalVal 
181 CATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGT 
GTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAAGA 

LeuAlaThrAlaThrProProGlySerValThrValProHisProAsnlleGluGluVal 
241 GCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGT 
CGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCA 

AlaLeuSerThrThrGlyGluIleProPheTyrGlyLysAlalleProLeuGluVallle 
301 TGCTCTGTCCACCACCGG AG AG ATCCCTTTTTACGGCAAGGCTATCCCCCTCG AAGTAAT 
ACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTA 


LysGlyGlyArgHisLeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAla 
CAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGC 
GTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCG 

f Overlap with 37b 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
AAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGT 
TTTCGACCAGCGTAACCCGTAGTTACGGGACCGGATGATGGCGCCAGAACTGCACAGGCA 


IleProThr 
481 CATCCCGACCAG 
GTAGGGCTGGTC 
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FIG. I 3 Translation of DNA 25c 


CysSerl^uThrValThrGlnl^uLeuArgArglieuHisGlnTrpIleSerSerGluCys 
1 ACTGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGT 
TGACGTCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCA 


ThrThrProCysSerGlySerTrpLeuArgAspIleTrpAspTrpIleCysGluValLeu 
61 GTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACAXCTGGGACTGGATATGCGAGGTGT 
CATGGTGAGGTACGAGGCCAAGGAGCGATTCCCTGTAGACCCTGACCTATAGGCTCCACA 

— -Overlap with 33b ■ — — 

SerAspPheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGlylleProPhe 
121 TGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTCGGATCCCCT 
ACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTGGACGGACCCTAGGGGA 


ValSerCysGlnArgGlyTyrLysGlyValTrpArgGlyAspGlylleMetHisThrArg 
181 TTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGGGGACGGCATCATGCACACTC 
AACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCCCCTGCCGTAGTACGTGTGAG 

CysHisCysGlyAlaGluIleThrGlyHisValLysAsnGlyThrMetArglleValGly 
241 GCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTCG 
CGACGGTGACACCTCGAGTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGC 

ProArgThrGysArgAsnMetTrpSerGlyThrPheProIleAsnAlaTyrThrThrGly : 
301 GTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGG 
CAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCC 

ProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGlu 
361 GCCCCTGTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGGTATGG AGGGTGTCTGCAG 
. CGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTC 

GluTyrValGluIleArgGlnValGlyAspPheHisTyrValThrGlyMetThrThrAsp 
421 AGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCGACTACGTGACGGGTATGACTACTG 

TCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGAC ... 

AsnLeuLysCysProCysGlnValProSerProGluPhePheThrGlu 
4 81 ACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAAT 
TGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 
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FIG. I 4- l Combined ORF of DNAs 4 0b/37b/3 5/36/8 1/3 2/3 3b/25c 

AlaTyrMetSerLysAlaHisGlylleAspProAsnlleArgThrGlyValArgThrlle 
1 TGCTTACATGTCCAAGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAAT 
ACGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTA 

ThrThrGly SerProI leThrTyrSerThrTyrGlyLy sPheLeuAlaAspGlyGlyCys 
61 TACCACTCGCAGCCCCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTG 
ATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCAC 

SerGlyGlyAlaTyrAspIlellelleCysAspGluCysHisSerThrAspAlaThrSer 
121 CTCGGGGGGCGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATC 
GAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCGTACGGTGTAG 

IlelieuGlylleGlyThrValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValVal 
181 CATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGT 
GTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACA 

LeuAlaThrAlaThrProProGlySerValThrValProHisProAsnlleGluGluVal 
241 GCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGT 
CGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCA 

AlaLeuSerThrThrGlyGluIleProPheTyrGlyLysAlalleProLeuGluVallle 
301 TGCTCTGTCCACCACCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAAT 
ACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTA 

LysGlyGlyArgHisLeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAla 
361 CAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGC 
GTTGCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCG 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
421 AAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGT 
TTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCA 

IleProThrSerGlyAspValValValValAlaThrAspAlal^uMetThrGlyTyrThr 
481 CATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATAC 
GTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATG 

GlyAspPheAspSerVallleAspTyrAsnThrCysValThrGlnThrValAspPheSer. 
54 1 CGGCGACTTCGACTCGGTGATAGACTACAATACGTGTGTCACCCAGACAGTCGATTTCAG 
GCCGCTGAAGCTGAGCCACTATCTGATGTTATGCACACAGTGGGTCTGTCAGCTAAAGTC 

I^uAspProThrPheThrlleGluThrlleThrLeuProGlriAspAlaValSerArgThr 
601 CCTTGACCCTACCTTCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCAC 
GGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTG 

GlnArgArgGlyArgThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGly 
661 TCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAG ATTTGTGGC ACCGGG 
AGTTGCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCC 

GluArgProSerGlyMetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCys 
721 GGAGCGCCCCTCCGGCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTG 
CCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGAC 

AlaTrpTyrGluI^uThrProMaGluThrThrValArgLeuArgAlaTyrMetAsnThr 
781 TGCTTGGTATGAGCTCACGCCCGCCG AGACTACAGTTAGGGTACGAGCGTACATGAACAC 
ACGAACCATACTCGAGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTG 

ProGlyLeuProValCysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeu 
841 CCCGGGGCTTCCCGTGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCT 
GGGCCCCGAAGGGCACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGA 

ThrHisIleAspAlaHisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyr 
901 CACTCATATAGATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTA 
GTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAAT 
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LeuValAlaTyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAsp 
961 CCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGA 
GGACCATCGCATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCT 

GlnMetTrpLysCysLeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeu 
1021 CCAGATGTGGAAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCT 
GGTCTA(^CCTTCACAAAGTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGA 

TyrArgLeuGlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlle 
1081 ATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACAT 
TATCTCTGACCCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTA 

MetThrCysMetSerAlaAspLeuGluValValThrSexThrTrpValLeuValGlyGly 
1141 CATCACATGCATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGOGG 
GTACTGTACGTACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCC 

ValLeuAlaAlaLeuAlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArg 
1201 CGTCCTGGCTGCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAG 
GCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTC 

ValValLeuSerGlyLysProAlallelleProAspArgGluValLeuTyrArgGluPhe 
1261 GGTCGTCTTGTCCGGG AAGCCGGCAATCATACCTG ACAGGG AAGTCCTCTACCGAGAGTT 
CCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAA 

AspGluMetGluGluCysSerGlnHisLeuProTyrlleGIuGlnGlyMetMetLeuAla 
1321 CGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGC 
GCTACTCTACCTTCTGACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCG 

GluGlnPheLysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVal 
1381 CGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGT 
GCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCA 

IleAlaProAlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMet 
1441 TATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATAT 
ATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGGTTCGTATA 


1861 

FIG. 14 


TrpAsnPhelleSerGlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnPro 
1501 GTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCC 
CACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGG 

AlalleAlaSerLeuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGln 
1561 CGCCATTGCTTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCA 
GCGGTAACGAAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGT 

ThrLeuLeuPheAsnlleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAla 
1621 AACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGC 
TTGGGAGGAGAAGTTGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACG 

AlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGly 
1681 CGCTACTGCCTTTGTGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGG 
GCGATGACGGAAACACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCC 

LysValLeuIleAspIleLeuAlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAla 
174 1 GAAGGTCCTCATAGACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGC 
CTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCG 

PheLysIl^etSerGlyGluValProSerThrGluAspLeuValAsnLeuLeuProAla 
1801 ATTCAAGATttTGAGCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCOGC 
TAAGTTCTAGTACTOGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCG 

IleLeuSerProGlyAlaLeuValValGlyValValCysAlaAlalleLeuArgArgHis 
CATCCTCTCGCCCGGMCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCA 
GTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGT 


-2 


ValGlyProGlyGluGlyAlaValGlnTrjiMetAsiiArgLeuIleAlaPheAlaSerArg 
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1921 CGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCGTTGGCCTCCCG 
GCAACCGGGCCCGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGC 

GlyAsnHisValSerProThrHisTyrValProGluSerAspAlaAlaAlaArgValThr 
1981 GGGGAACCATGTTTCCCCCACGCACTAGGTGCCGGAGAGCGATGCAGGTGCCCGCGTGAC 
CCCGTTGGTACAAAGGGGGTGCGTGATGCACGGGCTCTCGCTACGTCGACGGGCGCAGTG 

AlalleLeuSerSerl^uThrValThrGlnLeuLeiiArgArgLeuHisGlnTrpIleSer 
2041 TGCCATACTCAGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAG 
ACGGTATGAGTCGTCGGAGTGACATTGGGTGGAGGACTCCGCTGAGGTGGTCACCTATTC 

SerGluCysThrThrProCysSerGlySerTrpLeuArgAspIleTrpAspTrpIleCys 
2101 CTCGGAGTGTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATGTGGGAC TGG ATATG 
GAGCCTCACATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAG ACCCTGACCTATAC 

GluValLeuSerAspPheLysThrTrpLeuL^ 
2161 CGAGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGG 
GCTCCACAACTCGCTGAAATTCTGGACGGATTTTCGATTCGAGTAGGGTGTCGACGGACC 

IleProPheValSerCysGlnArgGlyTyrLysGlyValTrpArgValAspGlylleMet 
2221 G ATCCCCTTTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCAT 
CTAGGGG AAACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTC ACCTGCCGTAGTA 

HisThrArgCysHisCysGlyAlaGluIleThrGlyHisValLysAsnGlyThrMetArg : 
2281 GCACACTCGCTGCC ACTGTGG AGCTGAGATCACTGG ACATGTCAAAAACGGG ACG ATG AG 
CGTGTGAGCGACGGTGACACCTCGACTGTAGTGACCTGTACAGTTTTTGCCCTGCTACTC 

IleValGlyProArgThrCysArgAisnMetTrpSerGlyThrPheProIleAsnAlaTyr 
2341 GATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTA 
CTAGCAGCCAGGATCCTGGACGTCCTTGTACACGTCACCCTGGAAGGGGTAATTACGGAT 

ThrThrGlyProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgVal 
2401 CACCACGGGCCCCTGTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGT 
GTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCA 

SerAlaGluGluTyrValGluIleArgGlnValGlyAspPheHisTyrValThrGlyMet 
246 1 GTCTGCAGAGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCAGTAGGTGACGGGTAT 
CAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGC AGTGCCCATA 

ThrThrAspAsnLeuLysCysProCysGlnValProSerProGluPhePheThrGlu 
2521 GACTACTGAGAATGTCAAATGGCCGTGCCAGGTCCCATCGCCCGAATTTTTCAGAGAAT 
CTG ATG ACTGTTAG AGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 

FIG. 14-3 


J- 
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FIG. I 5 Translation of DKA 33c 

AlaValAspPh^IleProValGluAsnl^uGl^ 
1 GGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCAC 
CCGC»CCTGAAATAGGGACACCT^^ 

AspAsnSerSerProProValValProGlnSerPheGlnVamaHisI^uHisMaPro 
6 1 GGATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCC 
CCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGT 

ThrGlySerGlyLysSarfhrLysValPrc^ 
121 CACAGGCAGCGGCAAAAGCACCAAGGTCCCGGC^^ 

GTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCA 


iAuValLeuAsnProSerValAlaAla^ 
181 GCTAGTACTOUUXCCTCTGTTGCTGCAA^ 

CGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCra 

Overlap with 40b • — — - — — 

HisGlyIleAspPrcAsnIleArgThx<UyValArgTte^ 
241 TCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCAT 
AGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTC 


ThrTyrSerThrTyzGlyLysPheLeuAlaAspGlyGlyCysSerGlyG 
301 CACGTACTCCACCTACGGCAAGTTCCTTGCCXaAGaGCGGGTGCTCGGGGGGCGCTT 
GTGCATGAGGTGGATGCGGTTCAAGGAACGGCTGCCGCCCACGAGCCCC^ 


IlellelleCysAspGluCysHisSerThrAspAlaThrSer IleLeuGly IleGlythr 
361 CATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTra 
GTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGA^ 


Vall^uAspGlnAlaGluThrMaGlyAlaArgl^uValValLeuAlaThrAl 
421 TGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGC 

ACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGG 


ProGlySerValThrValProHisProAsnlleGl^ 
481 TCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACGGG 
AGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCC 


GluIleProPhCTyrGlyLysMallePrdlAuGluVallleLysGlyGlyArgHisLeu 

541 AGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAM 

TCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTCTAC^ 


IlePheCysHisSerLysLysLysC^sAspGluI^^ ■ 
601 CATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGG 
GTAGAAGACAGXftAG lTl CTTCTTCACGCTGCTTGAGCGGCG 


IleAsriAlaValAlaTyrTyrArgGlyLeuAspValSerVallleProThrSerGlyAsp 

661 CATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACGAGCGGCGA 
GTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCT 

ValValValValAlaThrAspAlal^uMetThrGlyTyrThrGlyAspPheAspSerVal 
721 TGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGT 
ACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCA 

IleAspCysAsnThrCys 

781 GATAGACTGCAATACGTGTG 
CTATCTG ACGTT ATGCACAC 
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FIG. 1 6 Translation of DNA 8h 

ProCysThrCysGlySerSerAspLeuTyr:^ 
1 CTCCClXKJWnTCCGGCTCCTCGGACC^ 

GAGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGT^^ 

ValArgArgArgGlyAspSerArgGlySerI^ul>uSerProArgProIl^ 
61 CCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTCGCCC^ 
GGCACGCGGCCGCCCO^CTATC^^ 

LysGlySerSerGlyGlyPrtLeulAuCys 
121 TGAAAGGCTCCTCGGGGGGTCCGCTGTT^^ 

ACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCCGTGCGGCACCCGTA 

Overlap with 

AlaMaValCysThrArgGlyValAl^ 
181 GGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGACTTTATCCCTGTGGA 
CCCGGCGCC^CACGTGGGCACCTCA^ 

33c 

GluThrThrMetArgSerProValPheThrAspAsnSer 
241 TAGAGACAACCATGAGGTCCCCGGTGTTCACGGAIAACTCCTC 
ATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAG 


FIG. 17 Translation of DNA 7e 

GlyTrpArgLeuLeuAlaProIleTh^ 
1 GGGGTGGAGGTTGCTGGCGCCCAXCACGGCGTAOGCCCAGCAGA^ 
CCCCACCTCCAACGACCGCGGG1MTGCCGCATGCGGGTCG 

CysIlelleThrSerl^uThrGlyArgAspLysAsnGlnValGluGlyGluValGlnlle 
61 GTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAGAT 
(^CGTATTAGTGGTCGGATTGACCGGCCCTGTTTTT^ 

ValSexThrAlaAlaGlnThrPheLeuAlaT^^ 
121 TGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCATC^ 

ACACAGTTGACGACGGGTTTGGAAGGACCGTTGCAGGTAGTTAC^ 

TyrHisGlyAlaGlyThrArgThrlleAla 
181 CTACCACGGGGCCX*GAACGAGGACCATCGCGTCACCCAAGGGTCC^^ 
GATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCC^ 


ThrAsnValAspGlnAspI^uValGlyTrpProAlaPr^ 
241 TACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCGGCAA^ 

ATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGCGTTCCATCGGCGAGTAAC 

Overlap with 8h 

ProCysThrCysGlySerSerAspLeuTyrLeuValThrArgHis 
301 ACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACG 
TGGGACGTGAACGCCGAGGAGCCTGGAAABGGACCAGTGCTOCGTGC 
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FIG. I 8 Translation of DMA 14c 


AsnMetTrpSerGlyThrPheProIleAsnAlaTy^ 
1 G* AGVTGT<MAGTGGGACCTTCCCCATT^TGCCTAalCCACGGGCCCCTGT^rcCrcT 

CmTACACCTCACCCTGGAAGGGGTAATTACGGA^ 


-Overlap with 25c- 


PrcsAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGluGluTyrValGluIle 

tcctgcgccgaactacacgttcgcgctatggagggtgtctgcagaggaatacgtggagat 
aggacgcggcttgatgtgcaagcgcgatacctcccacagacgtctccttaxgcacctcxa 


ArgGlnValGlyAspPheHisTyrValThrGlyMetTh^^ 
121 AAt^^^TC^MGGACTTCCACTAOSTGACGGGTATGACTACTGACAATCTI AAAl GCCC 
TTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAATTTACG 


CysGlnValProSerProGluPhePheThrGlulAuAs^ 
181 GTGCCAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCXA^TAGGTX 
CACGGTCCAGGGXAGCGGGCTTAAAAAGTGTCTTAAGCTGCCCCACGCGGATGTATCCAA 

AlaProProCysLysProI*ul«uArgGl^^ 
241 TGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCAOTA 
ACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCCAIAGTAAGTCTCATCCTGAGGTGCT 

TyrProValGlySexGlnLeuProCysGluPrttGluPrc^ 
301 ATACCCGGTAGGGTCGCAATTACCTXGCGAGCCCGAACCGGACGTGGCCG3GTTGACGTC 
TATGGGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGa^CCGGCACAACTGCAG 

MetLeuBirAspProSerHisIleThrAlaGluAlaAlaGlyArgArgLexiAlaArgGly 
361 CATGCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGG 
GTACGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCC 

SerProProSerValAlaSerSerSerAlaSerGlnLeuScrAlaProSerLeuLysAla 
421 ATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGC 
TAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCG 

ThrCysThrAlaAsnHisAspSerProAsp 
481 AACTTGCACCGCTAACCATGACTCCCCTGAT 
TTGAACGTGGCGATTGGTACTGAGGGGACTA 
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FIG, 1 9 Translation of -DNA 8f 

— — ; -—Overlap with 14c 1 

SerSerSerAlaSerGlnl^uSerA^ 
1 AGCTCCTCGGCTAGCGAGCTATCOGCTCCATCTCTCAAGGCAA 
TCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTC 


AspSerProAspAlaGluI^uIleGluAlaAsnLcuI^uTrpArgGlnGlxiMetGlyGly 
61 GACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCT^ 
CTGAGGGGACTACGACTCGAGTATCTCCGGTTGGA^ 

AsnlleThrAr^alGluSerGiuAsnLysVal^^ 
121 AACATCACCAGGGTTGAGTCA^ 

TTGTMTGGTCCCAACTCAGT C r irTO 

ValAlaGluGluAspGluArgGluIleSerVa^ 
181 GTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAATCCTGC^ 
CACCGCCTCCTCCTGCTCGCCCTCTAG 

ArgPheAlaGlnAlaLeuProValTrpAlaArgProAspTyx^ 
241 AGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGAC^ 
TCTAAGCGGGTCCGGGACGGGCAAACC^ 

ThrTrpLysLysProAspTyrGlUProProValVal^ 
301 ACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTGTCCGCTTCCACCTCCA 
TGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGAC^ 

LysSerProProvalPro 
361 AAGTCCCCTCCTGTGCCG 
TTCAGGGGAGGACACGGC 


FIG. 20 Translation of DNA 33f 


ValTrpAlaArgProAspTyrAsnProProlAuValGluThrTrpLysLysPrx^pT^ 
1 CGTTTGGGCGCGGCCGGACTATAAC^ 

GCAAACCCGCGCCGGCCTGAIATTGGGGGGCGATCACCTCTGCACCTTTT^ 

— — ~ — Overlap vith 8f — — — • . \' - ' . 

GluProProValValHisGlyCysPrc^iiProProPrdLysS^ProProValPro 
6 1 CGAACCACCTGTGGTCCATGGCTGCCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCC 
GCTTGGTCGACACCAGGTACCGAC^^ 

Pi^ArgLysLysArgThrValVallAuThrGluSerThrl^ 
121 GCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGGCC^ 
CGGAGCCTTCTTCGCCTGCCACOVG^ 

LeuAlaThrArgSerPheGlySerSerSerThrSert^ 
181 GCTCGCCACCAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACA^ 

CGAGCGGTGGTCTTCGAAACCGTCGAGGAGTTGAAGGCOGTAATGCCOGCTGTTATGCTC 

ThrSerSerGluProAlaProSerGlyCysProProAspSerAapAlaGluSerPhe 
241 AA^TCCTClXaUK:CCGCCCCTTCTGGC^ 

TTGTAGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTC 
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EP 0 318 216 B1 
Translation of DNA 33g 


AlaSerArgSerPheGlySerSerSerThrSerGlylle^^ 
1 GCCTCCAGAAGCTTTGGCA6CTCCTCAACTTC 
CGGAGGTCTTCGAAACCGTCGAGGM 

Overlap with 33f • 

SerSerGluProAlaProSezGlTCysProProAspSerAspAlaGluSerTTrSerSer 
61 TCCTCTXSAGCCCGCCC^^ 

AGGAGACTOGGGCGGGGAAGACCGACGGGGGGGC^^ 

MetProProJ>uGluGlyGluProGlyAspPreAspLeuSe^ 
121 ATGCCCCCCCTGGAGGGGGAGCCTGGGGATCCGGATCTZAGOGAOGG 
TACGGGGGGGACCTCCCCCTraaa^ 

ValSerSerGluAlaAsnAlaGluAspValValCysCysSex^ 
181 GTCAGTAGTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCra 
CAGTCATCACTCCGGTTGCGCCTCCTAC^ 

GlyAlaLeuValThrProCysAlaAlaGluGluGlnLys^ 
241 GGCGCACTGGTCACCCCGTGOGCOGOGGAAGAACAGAAACTGCCC^ 
CCGCGTGAGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGAOGGGT^ 

AsnSerLeuLetiArgHisHisAsnLeuValTyrSerThrThrSerArgSer 
301 AACTCGTTGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTC 
TTGAGCAACGATGCAGTGGTGTTAAACOVCATAAGGTGGTGGAGTGCGTCAC 


FIG. 22 Translation of DNA 7f 

GlyThrTyrValTyrAsnHisLeuThrProLeuArgAspTrpAlaHisAsnGlyLeuArg 
1 GGCACCTATGTTTATAACCATCTCACTCCTCTTCGGGACTGGGCGCACAACGGCTTGCGA 
CCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACGCT 

AspLeuAlaValAlaValGluProValValPheSerGlnMetGluThrLysLeuIleThr 
61 GATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACGAAGCTCATGACG 
CTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGC 

TrpGlyAlaAspThrAlaAlaCysGlyAspIlelleAsnGlyLeuProValSerAlaArg 
121 TGGGGGGCAGATACCGCCGCGTGCGGTGACATCATCAACGGCTTGCCTGTTTCCGCCCGC 
ACCCCCCGTCTATGGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGGCG 


ArgGlyArgGluIleLeuLeuGlyProAlaAspGlyMetValSerLysGlyTrpArgLeu 
181 AGGGGCCGGGAGATACTGCTCGGGCCAGCCGATGGAATGGTCTCCAAGGGTTGGAGGTTG 
TCCCCGGCCCTCTATGACGAGCCGGGTCGGCTACCTTACCAGAGGTTCCCAAGCTCCAAC 


LeuAlaProIleThrAlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCysIlelleThr 
241 CTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTGCATAATCACC 
GACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCGCACGTATTAGTGG 

— . Overlap with 7e ■ — 

SerLeuThrGlyArgAspLysAsnGlnValGluGlyGluValGlnlleValSerThrAla 
301 AGCCTAACTGGCCGGG ACAAAAACCAAGTGGAGGGTG AGGTCCAG ATTGTGTCAACTGCT 
TCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACACAGTTGACGA 


AlaGlnThrPheLeuAlaThrCysIleAsnGlyValCysTrp 
361 GCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGG 
CGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACC 
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FIG, 23/ Translation of DNA lib 

GlyGlyValValLeuValGlyLeuMetAlaLeuThrLeuSerProTyrTyrLysArgTyr 
1 GGCGGTGTTGTTCTCGTGGGGTTGATGGCGCTGACTCTGTCACCATATTACAAGCGCTAT 
CCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGTTCGCGATA 

IleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGluAlaGlnLeuHis 
61 ATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGAAGCGCAACTGCAC 
TAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTG 

ValTrpIleProProLeuAsnValArgGlyGlyArgAspAlaVallleLeuLeuMetCys 
121 GTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCATCTTACTCATGTGT 
CACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACA 

AlaValHisProThrLeuValPheAspIleThrLysLeuLeuLeuAlaValPheGlyPro 
181 GCTGTACACCCGACTCTGGTATTTGACATCACGAAATTGCTGCTGGCCGTCTTCGGACCC 
CGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGG 

I^uTrpIleLeuGlhAlaSerLeuI^uLysValProTyrPheValArgValGlnGlyLeu 
241 CTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTT 
GAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAA 

LeuArgPheCysAlaLeuAlaArgLysMe^^ 
301 CTCCGGTTCTGCGCGTTAGCGCGGAAGATG ATGGGAGGCCATTACGTGGAAATGGTCATC : 
GAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAG 


IleLysLeuGlyAlaLeuThrGlyThrTyrValTyrAsnHisLeuThrProLeuArgAsp 
361 ATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCACTCCTCTTCGGGAC 
TAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTG 

— ■ Overlap with It — — — — 

TrpAlaHisAsnGlyLeuArgAspLeuAlaValAlaValGluProValValPheSerGln 
421 TGGGGGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAA 
AGCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATGTGGGTCAGGAGAAG AGGGTT 


MetGluThrLysLeuIleThrTrpGly 
481 ATGGAGACCAAGCTCATCACGTGGGGGGC 
TACCTCTGGTTCGAGTAGTGCACCCCCCG 
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FIG. 24 Translation of DNA 14i 

GluTyrValVall^uIieuPhel^uI^uLeuAlaAspAlaArgValCysSerCysLeuTrp 
1 GGG AGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTGTGCTCGTGGTTGT 
CCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACA 

MetttetLeuLeuIleSerGlnAlaGluAlaA^ 
6 1 GGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACGTCGTAATACTTAATG 
CCTACTACGATGAGTATAGGGTTCGCCTCCGQCGAAACCTCTTGGAGCATTATGAATTAC 

AlaSerUuAlaGlyThrHisGlyL^^ 
121 CAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTG^T 
GTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAACGTA 

Tyrl^uLysGlyLysTrpValProGl^^ 

181 GGTATTTGAAGGGTAAGTGGG^ 

CCATAAACTTCCCATTCACCCACGC^CCTCGCCAGATGTGGAAGATGCCCTACACCG^ 

I^uLeuLeuI^uI^uMaLeuProGlnArgAlaTyrAlaLeuAsp 
241 TCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGTGGCCG 
AGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCACCGGC 


-Overlap with lib 


301 


SerCy sGlyGly ValValLeuValGlylieuMetAlaLeuThrLeuSer ProTyrTyrLy s 
CGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATATTACA 
GCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGT 


ArgTyrlleSerTrpCysLeuTrpTrpLeuGln 
361 AGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGAA 
TGGCGATATAGTCGACCACGAACACCACCGAAGTCTT 

FIG. 25 Translation of DNA 39c 


ProAlaProSer<31yCysProProAspSerAspAlaGluSerTyrS«S^^ProPro 

1 CC^CCCTTCTGGCTGCCCCCCCC^^ 

GGTCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGG 


61 


I^uGluGlyGluProGlyAspProAspLeuSerAspGlySe^ 

f^AG&G^AGCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACAGTCAGTAGT 
GA^TCCCCCTCGGACCCCTAGGCCTAGAA 


- •- Overlap with 33g — — — ^ 


RluAlaAsiiAlaGluAspValValCysC^sSerMetSerTyrSerTrpThirGlyAl 
19T rAG^GAAGGCGGAGGATCTCGTGTGCTGCTCAATGTCCTACTGOT 


vai ThTProTvsAlaAlaGluGluGlnLysLeuProIleAsnAlaLeuSerAsnSerLeu 
CAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGACTCGTTGAGCAAC 


iAuAraHisHisAsiil^uValTyrSerThrThrSerArgSerAlaCysGlnArgGlnXys 
241 OTAMTCACCACAATTTGGTGTATTCCACCIACCTCACGCAGTGCTTGCCAAAGGCAGAAG 

GATCCAG^GG^^ 


LvsValThrPheAspArgi«uGlnValI«uAspSerHisTyrGlnAspValI^ysGlu 

AAAGTCACATOTGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTAC^ 

T^CAGTGTAAACTGTCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGA^ 


ValLysAlaAlaAlaSerLysValLysAlaAsnPhe 
361 GTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTC 
CAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAAG 
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FIG. 26~l COMBINED ORF OF DNAs 
14i/llb/7f/7e/8h/33c/40b/37b/35/36/81/32/33b/25c/14c/8f/33f/33g/39c 


GluTyrValVall^uLeuPheLeuteuLeuAlaAspAlaArgValCysSerCysLeuTrp 
1 GGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGT 
CCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACA 

MetMetLeuLeuIleSerGlnAlaGluAlaA^ 
6 1 GGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTCGAGAACCTCGTAATACTTAATG 
CCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAATTAC 

MaSerLeuAlaGlyThrHisGlyLeuValSerPheLeuValPhePheCysPheAlaTrp 
121 CAGGMCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTGCAT 
GTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAACGTA 

TyrLeuLysGlyLysTrpValProGlyAlaValTyrThrPheTyrGlyMetTrpProLeu 
181 GGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTGGCCTC 
CCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACCGGAG 

LeuLeuLeuI^uLeuMaLeuPrc^ 
241 TCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGTGGC03 
AGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCACCGGC 

SerCysGlyGlyValValLeuValGlyLeuMetAlaLeuThrLeuSerProTyrTyrLys 
301 CGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATATTACA 
GCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATG 

ArgTyrlleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGluAlaGln 
361 AGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGA^ 

TCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAGACTCGTCTCACCTTCGCG 

LeuHisValTrpIleProProI^uAsnValArgGlyGlyArgAspAlaVallleLeuLeu 
4 21 AACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCATCTTAC 
TTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATG 

MetCysAlaValHisProThrl^uValPheAspIleThrLysLeuLeuLeiiAlaValPhe 
481 TOVTGTGTGCTGTACACCCGACTCTGG^ 

AGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAA 

GlyProLeuTrpIlel^uGlnAlaSerLeuIjeuLysVa 
541 TCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGCGTCC 
AG CC TGGGG AAACCTAAGAAG T TCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGG 

GlyLeuIeuArgPheCysMal^uAlaArgLysMetlleGlyGlyHisTyrValGlnMet 
601 AAGGCC TTCTCCGG TTCTGCG CGTTAGCGCGGAAGATG ATCGGAGGCCATTACGTGCAAA 
TTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTT 

ValllelleLysLeuGlyMaLeuThrGlyThrTyrVaaTyrAsnHisLeuThrPrdLeu 
661 TGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCACTCCTC 
ACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGAGGAG 

ArgAspTrpAlaHisAsnGlyLeuArgAspLeuAlaValAlaValGluProValValPhe 
721 TTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCT 
AAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAGCAGA 

SerGlnMetGluThrLysLeuIleThrTrpGlyAlaAspThrAlaAlaCysGlyAspIle 
781 TCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGGCAGATACCGCCGCGTGCGGTGACA 
AGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTATGGCGGCGCACGCCACTGT 

IleAsnGlyLeuProValSerAlaArgArgGlyArgGluIleLeuLeuGlyPixAlaAsp 
84 1 TCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCCAGCCG 
AGTAGTTGCCGAACGGACAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCCGGTCGGC 

GlyMetValSerLysGlyTrpArgLeuLeuAlaProIleThrAlaTyrAlaGlnGlnThr 


136 


EP0 318 216B1 


901 ATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGGGCQCATCACGGK 



961 

G' 

GlyGluValGlnlleValSerThrAlaAlaGlnThrPh^uAlaTl^sIl^nGly 
1021 AGGGTCAGGTa^TTCTCTCAACTGCTTOCOVAACX: 
TCCCACTOCAGGTCTAACACM 

ValCvsTrpThrValTyrHisGlyAlaGlyThrArgThrlleAlaSerP^y^lyPro 
1081 GGGTC^^^TC»O»CGGGGCCGGAA0GA 

1 ^cacaoa^ 

ii4i c^ffisassassasssa^^ 


1201 



Jg^TACAGTA^ 



ValGlyllePheArgAlaAlaValCysThrArgGlyVa^ 
1381 CCGTGGG^TATTTAGGGCCGCGGTGTGCACCMTGffl^TGGCTAAG^ 

GGCACCCGTATAAATCCOGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCTGAAAT 

ProValGluAsnl*uGluThrThi^^ 
1441 TCCCTGTGGAGAACCTAGAGACAACCATGAGGTOCCOGGTGTTCACGGATAACTC 

1441 mIIaEacctcttgga^ 

ProValvalProGlnSerPheGlnVammslAuHisAlaP^ 



SerThrLysValProAlaAlaTyrAlaJUaG^ 
1561 AAAG^CCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTA^ 

???SGTGGTTCeW^C^^ 

SeiValAlaiaaThrI*uGlyPheGlyAlaTy^ 

16 2 V ££a1a^^ 

GlvLvsPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAspIlellell^sJ^p 
1741 ACGGCMGTtS^^^OTGGTGCTCGGGG^ 
iSGTTCAAGGAA^ 

1801 i^ssssss^^ 

GluT^aGlyU^^ 

x:tctgacc 

FIG. 26-2 
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ValProHisProAsnlleGluGluValAlaLeiiSerThrTtu^lyGluIleProPheTyr 
1921 CTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCAC 

GACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTA^GAAAAA 

GlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeuIlePheCysHisSer 
1981 ACX3GCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTGTCATT 
TGCCGTTCCGATAGGGGGAGCTTCAOTAGTTCCCCCCCTCTGTAGAGTAGAAGACAGTAA 

LysLysLysCysAspGluLeuMaAlaLysI^uValAlaLeuGlylleAsnAlaValAla 

2041 caaagaagaagtgcgaogaactcgccgcaaa£ctggtcgcattgk 
gtttcttcttcacgctgcttoagoggogtttggaccagc^ 

TyrTyrArgGlyLeuAspVaWerValll^^ 
2101 CCTACTACCGCGGTCTTGACGTGTCOGTCATCCCGACCAGCGGCGATGTTGTOGTOGTGG 
GGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTOGCCGCTACAACAG 

ThrAspAlaLeiiMetThrGlyTyrThrGlyAspPheAspSer 
2161 CAAC0GATGCCCTCATGACCX3GCTATACCGGCGACTTCGACTCOT 

GTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACGTTA^ 

CysValThrGlnThi^alAspPheScrlieuAspProThrPheThrlleG 
2221 CGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGACAATCA 
GCACACAG1GGGTCTGTCAGCTAAAGTOGGAACTGGGATGGAAGTTC 

I^uProGlnAspAlaValSerArgThrGlnAr^ 
2281 CGCTCCCCCAGGATGCTGTCTCCOGCACTCAACX5TCGGGGCAGGACTGGCAGGGGGAAGC 
GCGAGGGGGTCCTACGACAGAGGGOGTGAGTTGCAGCCCOGTCCTGACCSTCCCCCTTC^ 

GlylleTyrArgPheValAlaProGlyGluArgProSerGlyMctPheAspSerSerVal 
2341 CAGGCATCTACAGATTTGTGGCAGOGGGGGAGOGCCCCTCCGGCATGTTCGACTCGTCC^ 
GTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCGTACAAGCTGAGCAGGC 

LeuCysGluCysTyrAspAlaGlyC^sAlaTrpTyrGluLeuThrPr^ 
2401 TCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGCCCGCCGAGACTA 
AGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCTCTGAT 

ValArgLeuArgAlaTyrMetAsnThrProGlyLeuProValCysGlnAspHisIieuGlu 
2461 CAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGACCATCTTG 
GTCAATCCGATGCTOGCATCTACTTGTGGGGCCCCGAAGGGCAC 

PheTrpGluGlyValPheThrGlyLeuThrHis^^ 
2521 AATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATCCCAGA 
TTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAGGGTCT 

LysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyxGli^aThrValCy 
2581 CAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCA^ 
GTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCA^ 

AlaGlnAlaProProProSerTrpAspGln^ 
2641 GGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCTCAAGC 
CCCGAGTTCGGGGAGGGGGTAGCACGCTGGTCTACACCTTGACAAACTAAGCGGAGTTCG 

ThrLeuHisGlyProThrProLeuI^^ 
2701 CCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATCAi^ATCA 
GGTGGGAGGTACCCGGTTGTGGGGAOGATATGTC^GACCOGOGACAAGTCOT 

LeuThrHisProValThrLysTyrlleMetThrCysMetSerAlaAspLeuGluValVal 

2761 CCCTGACGCACCCAGTCACCAAATACATCATGACA^ 

GGGACTGCGTCGGTCAGTGGTTTATGTAGTACTGTACGTACAGCOGGCTGGACCTCCAGC 

ThrSerThrTrpVall^uValGlyGlyVall^uAlaMaLeuMaMaTyrCysI^e^ 

2821 TCACGAGCACCIX^GTCCTCGT^^ 

AGTCCTCGTGGACCCACGAGCAACCGCCGCAGGACCGAOGAAACOT 

ThrGlyCysValVallleValGlyArgValValLeuSerGlyLysProAlallellePro 

FIG. 26-3 
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2881 CAACAGGCTGCGTGGTCATAG^ 

GTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAGTATG 

AspArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGlnHisI^uPro 
2941 CTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTCCTCTCAGCACTTAC 
GACTGTCCCTTCAGGAGAT<XpCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTGAATC 

TyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlaLysAlal^uGl^ul^u 
3001 CGTACATCGAGCAAGGGAT<^TGCTCGCCGAGCAGTTCAAG 

GCATGTAGCTCGTTCCCTACTACGAGCXX^TCGTCAAGTTCGTC 

GlnThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrpGlnLys 
3061 TGCAGA(XGCGTCCCGTOVGGCAGAGGTTATCG^ 
ACGTCTGGCGCAGGGCAGTCCGTCTCCAATA^ 

I^uGluThrPheTrpAlaLysHisM^ 
3121 AACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATACTTGG 
TTGAGCTCTGGAAGACCXGCTTCGTATACACCTTGAAGTAGTCACCCTATC 

GlvI^uSerThrLeuProGlyAanPro^ 
3181 CX5GGCTTGTCAACGCTGCCIGGTAACCCCGC^ 

GCCCGAACAGTTGCGAOGGACXATTGGGGCGGTAAOGAAGTAACTACOGAAAATGTCGAC 

ValThrSerProLeuThrThrSerGlnThr^ 
3241 CTGTCACCAGCCC^CTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGGGTCTC 
GACAGTGGTCGGGTCATTGGTGATCGGTTTGGGAGGAGAAGTTGTATA 

AlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheValGlyAlaGlyLeuAlaGly 

3301 TOK^CCCAGCTCX^OX:CCCCGGTGC0GCTACTGC 

ACOGACGGGTOGAGCGGCGGGGGCCACGGCGATGAOGGAAACAGCCGC^^ 

AlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeuAlaGlyTyrGly 
3361 GCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGGTATC 
CGCGGCGGTAGCCG TCACAACCTGACCCCTTCCAGGAG TATCTGTAGGAACGTCCCATAC 

MaGlyVamaGlyAlaLeuValAlaPheLysIleMetSerGlyGluValProSerThr 
3421 GCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCCCTCCA 
CGCGCCCGCA(X3GCCCTCGAGAACACCGTAAGTTCTAGTACT0GCCACTCCAGGGGA 

GluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeuValValGlyVal 
3481 CGGAGGACCTGGTCAATCTACTGCCCGCCAlCCTCTCGCCCGGAGCC^ 
GCCTCCTCGACCAGTTAGATGACGGGOGGTAGGAGAGCG^^ 

ValCysAlaAlalleLeuArgArgHisValGlyProGlyGluGlyAlaValGlnTrpMet 
3541 TGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGC^ 
ACOVGACACGTOTTTATGAC^^ 

AsnArgLeuIleAlaPheAlaSer^ 
3601 TGAACCGGCTGATAGCCTTOGCCTCCCGGGGGAACCATGTTTCC^ 

ACTTGGCCGACTATCGGAAGOGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATGCACG 

GluSerAspAlaAlaAlaArgValThrAlall^uSerSerl^uThrValThrGl^u 
3661 CGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCC 

GCCTCTCGCTAOGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACAT^^ 

I^uArgArgl^uHisGlnTrpIleSerSe^ 
TCCTGAGGCGACTCCACCAGT^^ 

AGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCAAGGA 
L^uArgAspIleTrpAspTrpIleCysGluVal^ 


3721 


3781 


GGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGAC 
CCGATTCCCTGTAGACCCTGACCTATACGCTCCACAAC1 

AlaLysI^uMetProGlnLeuProG^ 
AAC^TAAGCTCATGCCACAGCTGCCTGGGA 


3841 

FIG. 26-4 
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GlyvalTrpArgValAspGlylleMetHisThrArgCysHisCyeGlyAlaGluIlcThr 
3901 AGGGGGTCTGGCGAGTGGACGGCATCATGCACACTOGCTGCCACTGTGGAGCTGA^ 
TCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGGTO 

GlyHisValLysAsnGlyThrMetArglleValGlyProArgThrCysArgAanMet^ 
3961 CTGGACATGTCAAAAACGGGACGATGAGGATCGTCGGTCCTAGGACCTGCAGGAACATO 
GACCTGTACAGTTTTTGCCCTGCTAC^ 

SezGlyThrPheProIleAsnAlaTyrThrTh^ 
4021 GGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCCTGOGC 
CCTCACCCTGGAAGGGGTAATTAGGGATGTGGTGCCCGGGGACATGGGGGGAAGG 

AsnTyrThrPheAlal^uTrpArgValSerAlaGluGlu 
4081 CGAACTACACGTTCGOGCTATGGA^ 

GCTTGATGTGCAAGCGOGATACCTCCCACAGAC^ 

GlyAspPheHisTyrValThrGlyte^ 
4141 TGGGGGACTTCCACTACGTGMGGGTAT^ 

ACCCCCTOAAGGTGATGCACTGCCCATACTGATGACTGTTAGAGOT 

ProSerProGluPhePheThrGluLeuAapGl^^ 
4201 TCXICATCGCCC^AAaOTTTCACAGAATTGGACGGGG 

AGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAAACGGGGGG 

CysLysProLeuLeuArgGluGluValSerPhe^ 
4261 CCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTO^ 

GGACGTTOGGGAACGACGCCXTCCTCCATAGTAAGTC^ 

GlySerGlnl^uProCysGluProGlUPr^ 
4321 TAGGGTCGCAATTACCTTGCGAGCC^^ 

ATCCCAGCGTTAATGGAACGCTOGGGCTTGGCClXrc 

AspProSerHisIleOtorAlaGluAlaAla^ 
4381 CTGATCCCTCCCMATAACAGCAGAGGCGGCCGGGCGAAGGra 


SerValAlaSerSerSerAlaSa^lnLeuSerAlaProSerLeuLyaAlaThrCya^ 
4441 CCTCTGTGGCCAGC TCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAACTTGCA 
GGAGACACCGGTCGAGGAGCOGATCGGTOGATAGGCGAGGTAGAGAGro 

MaAsnlU-sAspSerPraAspAlaGluLeuIleGluW 
4501 COTCTAACCATGACTCCCCTGATGCTGAGC^ 

GGCGATTGGTACTGAGGGGACTAOGACTCGAGTATCTCCGGTTGGAGGATACCTCCGTOC 

MetGlyGlyAsnlleThrAr^alGluSerGluAsnLysValVallleLeuAspSerPhe 
4561 AGATGGGOGGCAACATCACCAGGGT1GAGTCAGAAAACAAAGTGGTGATTCTGGACT 
TCTACCC^CGTTGTAGTCGTCCCAACTCAGTCTTTO 

AspProLeuValAlaGluGluAspGluArgGluIleSerValProAlaGluIleLeuArg 
4621 TCGATCCGCTTGTGGCGGAGGAGGAC^AGCG^ 
AGCTAGGOGAACACCGCCTCCTCC^ 

Ly sSer ArgArgPheAlaGlnAlaLeuProValTrpAlaArgProAspTyrAsnProPro 

4681 ggaagtctcggagatt0gcccaggccctgccxx5tttgg 
ccttcagagcctctaagcgggtc^^ 

LeuValGluThrTrpLysLysProAspTyrGluProProValValHisGlyCysPrdLeu 
4741 CGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACXIACCTGTGGTCCATGGCTGTOOTC 
GCGATC^CCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACOGAOGGW 

ProProPrGLysSerProProValProProProArgLysLyaArgThrValVall^uTte 
4801 TTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAA6AAGCGGA0GGTGGTCCTCA 
AAGGTGGAGGTTTCAGGGGAGGACA0GGAGGOGGAGCCTTCTT0GCCTGCCACCAGGAGT 

GluSerThrLeuSerThrAlaLeu^ 

FIG. 26-5 
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4861 CTGAATCAACCCTATCTACTGCCTTGGCCGA 

GACTTAGTTGG<^TAGATGACGGAACCGGCTCGAGCGG 

ThrSerGlylleThrGlyAflpAsnThrThrThrSerSerGiitf^ 
4921 CAACTTCCGGCATTAOGGGCGACAATACGAC^ 

GTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGACTCGGGCGGGG 

ProProAspSer AspAlaGluSerTyrSerSerMet ProPrbLeuGluGlyGluProGly 
4981 GCCCCCCCXyvCTCCGACX^TGAGTCCTATTCCTCCATC 

CGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCC^ 

AspProAspteuSerAspGlySerTrpSerThrValS^ 

5041 gggatccggatcttagcgacgggtcatggtcaa^ 
ccctaggcctagaatox:tgcccagt^^ 

ValValCysCtf sSerMetSerTyrSerT^^ 

5101 atgtcgtgtgctgctcaatgtcttactcttggacaggcgcact 
tacagcacacgacgagttacagaatgagaacctgt^ 

GluGluGlnLy8LeuProIleAanAlaLeuSerAsnSerLeuLeuArgHisHisAsnLeu 
5161 OGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACCAC^ 
GCCTTCTTCTCTTTGAC^GGTAGOTACGTGAT^ 

ValTyrSerThrThrSaxArgS^ 
5221 TGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAGAAAGTCACATTTC^ 

ACCACATAAGGTGGTGGAGTGCGTCAOGAACGGT^ 

LeuGlnValLeuAspSerHisTyrGlnAspValLeuLysGluValLysAlaAlaAlaSer 
5281 GACTGCAAGTTCTGGACAGCCATTACGAGGACGTACTCAAGGAGGTTAAAGCAGOGGCGT 
CTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATCAGTTC 

LysValLysAlaAsnLeu 
5341 CAAAAGTGAAGGCTAACTTG 
GTTTTCACTTCOGATTGAAC 
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FIG. 27 Translation of DNA 12f 

IlePheLysIleArgMetTyrValGlj^lyValGluHisArgLeuGluAlaAlaCysAsn 

1 CCATATTTAAAATCAGGMOT 

GGTATAAATTTTAGTCCTACATGCACCCTCCCCAGCT^ 

TrpThrArgGlyGluArgCysAspI^uGluAspArgAspArgSertluLeuSerProLeu 
61 ACTGGACGCGGGGCGAACGTTGCGATCTGGAAGACAGGGACAGGTCOGAGCTCAGCCCX5T 
TGACCTGCGCCCCGCTTGCAACGCTAGACCTTCTGTCCC 

I^uLeuThrThrThrGlnTrpGlnVall^uProCysSerPheThrThrLeuProAlaLeu 
121 TACTGCTGACCACTACACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAGCCT 
ATGACGACTGGTGATGTGTCACCGTCCAGGAGGGC^CAAGGAA 

SerThrGlyLeuIleHisLeuHisGlnAsnlleValAspValGlnTyrLeuTyrGlyVal 
181 TGTCCACCGGCCTCATCCACCTCCACCAGAACATTGTGGACG1XKIAGTACTTGTACGGGG 
ACAGGTGGCCGGAGTAGGTGGAGGTGGTCTTGTAAC^CCTGCACGTCATGAACATGCCCC 


GlySerSerlleAlaSerTrpAlalleLysTrpGluTyrValValLeuLeuPheLeuLeu 
241 TGGGGTCAAGCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTG 
ACCCCAGTTCGTAGCGCAGGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAAG 


LeuAlaAspAlaArgValCysSerCysLeuTrpMetMetLeuLeuIleSerGlnAlaGlu 
301 TGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGG 
ACGAACGTCTGCGCGCGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCG 

Overlap with 14 i 

AlaAlaLeuGluAsnLeuVallleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeu 
361 AGGCGGCTTTGGAGAACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTC 
TCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAG 


Val 

421 TTGTATC 
AACATAG 
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FIG. 28 Translation of DNA 35 f 

— - Overlap with 39c— —— — -~— 

LeuLysGluValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerValGluGlu 

1 TGCTOVAGGAGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCT 
ACGAGTTCCTCCAATTTCGTCGCCGCAGTTTTCACTTCC^ 

AlaCysSerLeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAlaLysAsp 
61 AAGCTTGCAGCCTGACGCCCCCACACTCAGCGAAATCGAAGTTTGGTTATGGGGCAAAAG 
TTCGAACGTCGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAAT^ 

ValArgCysHisAlaArgLysAlaValThrHisIleAsnSerValTrpLysAspLeuLeu 

121 ACGTCCGTTGCCATGCCAGAAAGGCCGTAACCCACATCAAC 

TGC^GGCAACGGTACGGTCTTTCCGGCATTGGGTGTAGTTGA^CACACCTTTCTGGAAG 

GluAspAsnValThrProIleAspThrThrlleMetAlaLysAsnGluValPheCysVal 
181 TGGAAGACAATGTAACACCAATAGACACTACCATCATGGCTAAGAACGAGG 

ACCTTCTGTTACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGC 

GlnProGluLysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeuGlyVal 
241 TTCAGCCTGAGAAGGGGGGTCGTAAGCCAGCTGGTCTCATCGTGTTCCCCGATCTGGGCG 
AAG TCGGACTCTTCCCCCCAGCATTCGGTCG AGCAGAG TAGCACAAGGGGCTAG ACCCGC 

ArgValCysGluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAlaValMet 
301 TGCGCGTGTGCGAAAAGATGGCTTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGA 
ACGCGCACACGCTTTTCTACCGAAACATCCTGCACCAATGTTTCGAGGGGAACCGGCACT 

GlySerSerTYrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuValGlnAla 
361 TGGGAAGCTCC TACGG ATTCCAATACTCACCAGG ACAG CGiGGTTGAATTCCTCGTGCAAG 
ACCCTTCGAGGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTC 

TrpLysSerLysLysThrProMetGlyPheSerTyrAspThrArgCysPheAspSerThr 
*21 CGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCA 
GCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACX3AAACTGAGGT 

ValThrGluSerAspIleArgThrGluGluAla 
481 CAGTCACTG AGAGCGACATCCG TACGGAGG AGGCA 
GTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGT 


143 


EP 0 318 216 B1 


FIG. 29 Translation of DNA 19g 


GluPheteuValGlnAlaTrpLysSerLysLysThrProMetGlyPheSerTyrAspThr 
1 GAATTCCTCGTGCAAGCGTGGAAGTCGAAGAAAACCCCAATGGGGTTC^ 

CTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGG 

■ — Overlap with 3 5f — — — — — — -—— 

ArgCysPheAspSerThrValThrGluSerAspIleArgThrGluGluAlalleTyrGln 
6 1 CGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAATCTACCAA 
GCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGTTAGATGGTT 

CysCysAspLetiAspProGlnAlaArgValAlalleLysSerLeuThrGluArgLeuTyr 
121 TGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAAGTGCCTCACCGAGAGGCTTTAT 
ACAACACTGGAGCTGGGGGTTCGGGCGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATA 

ValGlyGlyProLeuThrAsnSerArgGlyGluAsnCysGlyTyrArgArgCysArgAla 
181 GTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCG 
CAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGC 

SerGlyValLeuThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAlaArgAla 
241 AGCGGCGTACTGACAACTAGCTGTGGTAACACCCTCACTTGCTACATCAAGGCCCGGGCA 
TCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGAAGGATGTAGTTCCGGGCCCGT 

AlaCysArgAlaAlaGlyLeuGlnAspCysThrMetLeuValCysGlyAspAspLeuVal 
301 GCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCGACGACTTAGTC 
CGGACAGGTCGGCGTCCCGAGGTCCTGACGTGGTACGAGCACACACCGCTGCTGAATCAG 

VallleCysGluSerAlaGlyValGlnGluAspAlaAla 
361 GTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGCGAG 
CAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGCTC 
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FIG. 30 Translation of DNA 26g 


cS^SSaaSSa^cgg^tcgtcggacagctcggcgtcccgaggtc 


241 ssbskske^^ 

301 ssssss» v « 

ca^Sgcgg^ctgccgcgacctttctcccagatgatggagtgggcactg 

ThrProI^uAlaArgAlaAlaTrpGluThrAlaArgHisThrProValAsnSerTrpLeu 

361 ACCCCCCTO^GAGA^ 

TGGGGGGAGCGCTCTOTACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGA 

GlyAsnllelleMetPheAlaProThrLeuTrpAla 
421 GGCAACATAATCATGTTTGCCCCCACACTGTGGGCG 
CCGTTGTATTAGTACAAACGGGGGTGTGACACCCGC 

FIG. 31 Translation of DNA 15e 


— — ■ — Overlap with 26g — ■ — ~ 

AlaTrpGluThrAlaAMHisThrPTOValAsnSerTrpI^uGlyAsnllelleMetPhe 

61 TGCGTCGGAG^ 

SACCCTCT^G^TGTCTGAGGTCAGTTAAGGACCGArcCGTTGTATTAGTACAA 


AlaProThrLeuTrpAlaArgMetlleLeuMetThrHisPhePheSerValI.euIleAla 

121 TGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCT 

ACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATATCG 

AwAspGlnLeuGluGlnAlalauAspC^^ 

181 CA^GA^AGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCATAGA 

GTrcCT^TCGA^TTGTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTATCT 

ProLeuAspLeuProProIlelleGlnArgLeu 
241 ACCACTTGATCTACCTCCAATCATTCAAAGACTC 
TGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAG 
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FIG. 32" I C04BINED ORF OF DNAs 12f through 15e 

IlePheLysIlei^MetTyrValGlyGlyValGluHisArgDBuGluAlaAl^sAsn 
1 CCATATTTAAAATCAGGATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCA 

Stataaat^ 

TrDThrAraGlvGluAxattsAspI^uGlu^ 
61 AClGGACGCGGGGCGAACGTTGCGATCTGGAA^CAGGGACAGGTCCGAGCTCAGCCOT 

I/HU>wThrThrTlu*UnTrpGl^^ 
121 tactcctgaccactacacagtggcaggtcctcccgtgttccttcaca^cctaccag^t 

atgacgac^t<»tgtgtcaccgtccaggag^ 


181 TG 
ACA( 


SerthrGlyl^uIleHisLeuHisGlnAsnlleValAspValGlnTyr^Tj^yVal 
ITCCACCGGCCTCATCCACCTCCACCAGAACATTGTCGACGTGCAGTACTTGTA^ 
' \GGTGGCCGGAGTAGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCC 


GlySerSerlleAlaSerTrpAlalleLysTrpGluTyrValVal^^ 
241 TGGGGTC^GCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTC 

ACCCCAGTTCGTAGCGCAGGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAAG 

I^uAlaAspAla^rgValCysSerCysLeuTrpMet^ 

tgcttgcaga<xcgcgcgtctgctcctgcttc^ 

^cgtctgcgcgcgom;acgaggacgaacacctactacgatgagtatagggttcgcc 


A 

301 


ACGAAi 


AlaAlaI«uGluAsnI^uValIleI«uAsnMaMaSerLeuMaGlyThrHi^yLeu 
361 AGGCGGCTOTCGAGAACCTTOTAATACTOAATCCAGCATCCCTGGCCGGGA ■ 
TCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAG 


TCI 

601 


ValSerPheI«uValPhePheCy8PheAlaTrpTyrI«uLysGlyLysT^yalProGly 
421 TTGTATCCTTCCTCGTGTTCTTCTGCTTTGC^TGGTATTTGAAGGGTAAGTGGGTGCCCG 
AACATAGGAAGGAGCACAAGAAGACGAAACGTACCATAAACTTCCCATTCACCCACGGGG 

AlaValTvrThrPheTyrGlyMetTrpProLeuLeuI^iiI^uI^uLe u AlaLeuProGln 

481 GAGCGGTCTAC»CCTTCTACGGGATCTGGCCT^ 

CTCGCCAGATGTGGAAGATGCCCTACACCGGAGAGGAGGACGAGGACAACCXJCAACGGGG 

ArgAlaTyrAlaLeuAspThrGluValAlaMaSerCysGlyGlyValVall^alGly 
541 AGCGGGOSTACGCGC^ACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTG 
TCGCCCGCATGCGCGACCTGTGCCTC^ 

I^uMetAlaLeuThrl^uSerProTyrTyrLysArgTyrlleSerTrpCys^TrpTrp 
GGTTGA^CGCTCACTCTCTCACCATATTACAAGCGCTATATCAGCTGGTGCTTC^T 
CCAACTACCGCGACTGAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCA 

I^uGlhTyrPheLeuThrArgValGluAlaGlnLeuHisValTrpIleProProLeuAsh 
661 GGCTTCAGTATTI^TGACCAGAGTGGAAGGGCAACTGCACGTO 

CCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTGCACACCT 

ValArgGlyGlyArgAspAlaVallleLeuLeuMetCysAlaValHisPr^hrXeuVal 

721 ACGTCCGAGGGGGGCGCG acgc<x;tcatcttactcatgtgtc 

TGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACACGACATGTGGGCTGAGACC 

PheAspIleThrLysI^uLeuLeuAlaValPheGlyProI^uTrpIleLeuGlnAljSer 
781 TATTTGACATCACCAAATTGCTGCTGGCCGTCTTCGGACCCCTri^ 

ATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGT 

LeuLeuLysValProTyrPheValArgValGlnGlyLeuLeuArgPheCysAlaLeuAla 
841 GTTTCCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTTCTCCGGTTCTGTCCG^AG 
OVAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAAGAGGCCAAGAOGCGOVATC 

ArgLysMetIleGlyGlyHisTyrValGlnMetValIleIl^ysI«uG^Ala^Thr 
901 CGCGGAAGATGATCGGAGGCCATTACGTGCAAATGGTCATCATTAMTTA^ 

GCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAGTAATTCAATCOCCGCGAAT 
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GlyThrTyrValTyrAsnHisLeuThrProLeuArgAspTrpAlaHisAsnGlyLeuArg 
961 CTGGCACCT ATGTTT ATAACCATCTCACTCCTCTTCGGGACTCGGCGCAC 

GACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCOGCGTGTT^ 

AspLeuAlaVamaValGluProValValPheSerGlnMetGluThrLysLeuIleThr 
1021 GAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAAATOTAGACCAAGCTCA 
CTCTAGACOSGCACCGACATCTCX^TCAGCAGAAGAGGGTm 

TrpGlyAlaAspThrAlaAlaCysGlyAspIlelleAsnGlyLeuProValSerAlaArg 
1081 OGTGGGGGGCAGATACCGCCGCGTGCGGTGACATC&TCAAOGGCTTGCCTC 
GCACCCCCCGTCTATGGCGGCGCACGCCACTGTAGTAGra^ 

AraGlyArgGluIleLeuLeuGlyProAlaAspGlyMetyalSerLysGlyTr^ 
1141 GCAGGGGCCGGGAGATACTGCTCGGGCCAGCCGATC&AATGGTCTCCAAGGC^ 

cx;tccccggccctctatgacgagcccggtcggctaccttaccagaggttc 

I^uAlaProIleThrAlaTyrAlaGlnGlnThrArgGlyl^uLeuGlyCysIlelleThr 

1201 TGCSOGCGOtXXSCii^^ 

ACGACCGCGGGTAGTGCCGC^TGCGGGTCGTCTGTTCCCCC^AGGATCCCACGTATTAGT 

Serl^uThrGlyAMAspLysAsnGlnValGluGlyGluValGlnlleValSerTh^a 

1261 ccagcctaactggccgggacaaaaaccaagtggagggtgaggtccaga™ 

GGTCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTC 

AlaGlnThrPheIx»uAlaThrCysIleAs^ 
1321 CTGCCCAAACCTTCCTGGCAAGGTGCATCAATGGTC 

GACGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACCTGACAGATGGTGCCCC 

GlyThrArgThrlleAlaSerProLysGlyProVallleGlnMetTyrThrAsnValAsp 
1381 GCGG^CGAGGACCATCGCGTC^CCCAAGGGTCCTGTCATCCAGATGTATACCAATGTAG 
GGCCTTGCTCCTGGTAGCGCAG TGGGTTCCCAGGACAGTAGGTCTACAT ATGGTTACATC 

GlnAspLeuValGlyTrpProAlaProGlnGlySerArgSerlieuThrProCysThrCys 
1441 ACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGACACCCTGCACTT 
TGGTTCTGGAAGACCCGACCGGGCGAGGCGTTCCATCGGCGAGTAACTGTGGGACGTC 

GlySerSerAspLeuTyrLeuValThrArgHisAlaAspVallleProValArgArgArg 
1501 GCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACGCCGATGTCATTCCCGTGCGCCGGC 
CGCCGAGGAGCCTGGAAA1XMACCAGTGCTCCGTGC(^CTACAGTAAGGGCACGCGGCCG 

GlyAspSerArgGlySerLeuLeuSerProArgProIleSerTyrLeuLysGlySerSer 
1561 GGGGTGATAGCAGGGGCAGCCTGCTOTCGCCCCGGCCCATTTCCTACTTGAAAGGCTCC^ 
CCCCACTATCGTCCCCGTCGGACGACAGCGGGGCCGGGTAAAGGATGAACTTTCCGAGGA 

GlyGlyProLeuLeuCysProAlaGlyHisAlaValGlyllePheArgAlaAlaValCys 
1621 CGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACGCCGTGGGCATATTTAGGGCCGCGGTGT 
GCCCCC»GGCGAC^«CGGGGCGCCCCGTGCGG(^CCa;TATAAATCC<^ 

ThrArgGlyValMaLysAlaValAspPKelleProValGliJAsnLeuGluThrThrMet 
1681 GCACCCGTGGAGTGGCTAAGGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCA 
CGTCGGCACCTCACCGATTCCGCCACCTGAAATAGGGACACCTCTTTOATCT^ 

ArgSerProValPheThrAspAsnSerSerProProValValProGlnSerPheGlnVal 
1741 IX^GGTCCCCGGTGTTCACGGATAACTCCTCTCCACCAGTAGTGCCGCAGAGCTTCCAGG 
ACTCCAGGGGCCACAAGTCCCTATTGAGGAGAGGTGGTCATCACXX3 

AlaHisLeuHisAlaProThrGlySerGlyLysSerThrLysValProAlaAlaTyrAla 
1801 TGGCTCACCTCCATGCTCCCACAGGCAGCGGCAAAAGCACCAAGGTCGCGGCTGCATATG 
ACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCOGACGTATAC 

AlaGlnGlyTyrLysValLeuValLeuAsnProSerValAlaAlaThrLeuGlyPheGly 
1861 CAGCTCAC^^TATAAC^TGCTAGTACTCAACCCCTCTGTTGCT^ 

GTCGAGTCCCGATATTCCACGATCATGAGTTGGGGAGACAACGACGT^ 

AlaTyrMetSerLysAlaHisGlylleAspProAsnlleArgThrGlyValArgThrlle 
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1921 GTGCTTACATGTCCAAGGCTCATGGGATCGATCCTAACATCAGGACCG 
CACGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCC 

ThrThrGlySerProIleThrlVrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCys 
1981 TTACCACTGGCAGCCCCATCACGTACra 

AATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCOT 

SerGlyGlyAlaTyrAspIlellelleCysAspGluCysHisSerThrAspAlaThrSer 
204 1 GCTCGGGGGGCGCTTATGACATAATAATTTG1X5ACGAG 

CGAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGG7GCCTAGGGTGTA 

IlelieuGlylleGlyThrValLeuAspGlnMaGluThrMaGlyAlaArgLeuValVal 
2101 CCATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGAC 
GGTAGAACCCGTAGCC&TGACAGGAACTGGTTCGTCTCTGA 

LeuAlaThrMaThrProProGlySerValThrValProHisProAsnlleGluGluVal 
2161 TGCTCGCCACCXKX^CCCCTCCGGGCTCraTCAOX; 

ACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTCACACGGGGTAGGGT^ 

AlaLeuSerThrTh3cGlyGluIleProPheTyrGlyLysAlaIleProLeuGluValIle 
2221 TTGCTCTGTCCACCACCGGAGAGATCCCTTTTTACGGCAA 

AACG AG ACAGGTGG TGGCCTCTCTAGGG AAAAATGCCG TTCCG ATAGGGGGAGCTTCATT 

LysGlyGlyArgHisLeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAla 
2281 TCAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAA 

AGTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGOGGC 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
2341 CAAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTAGTACCGCGGTCTTGAOT 
GTTTCGACCAGCGTAACCGGTAGTTACGGCACCGGATGA 

IleProThrSerGlyAspValValValVamaThrAspAla^ 
2401 TCATCCCGACCAGCGGCGATGTTGTCGTCX51XX5CAACCGATGCCC 
AGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCro 

GlyAspPheAspSerVallleAspCysAsnThrCysValThrGlnThtValAspPheSer 
2461 CCGGCGACTTCGACTCGGTGATAGACTGCAATACX3TGTGTCACC 

GGCCGCTGAAGCTGAGCCACTATCTGACX3TTATGCACACAGTGGGTCTG 

LeuAspProThrPheThrlleGluThrlleThrLeuProGlnAspAlaValSerArgThr 
2521 GCCTTGACCCTACCTTCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTC 
CGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTTC 

GlnArgArgGlyArgThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGly 
2581 CTCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTC 
GAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCX3TA 

GluArgProSerGlyMetPheAspSerSeTValLeuCysGluCysTyrAspAlaGlyCys 
2641 GGG AGCGCCCCTCCGGCATGTTCGACTCGTCCGTCCTC 

CCCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGA 

AlaTrpTyrGluLeuThrProAlaGluThrThrValArgl^uArgAlaTyrMetAsnThr 
2701 GTGCTTGGTATGAGCTCAOGCCCGCCGAGACTACAGOTAGGCT 
CACGAACCATACTCGAGTGCGGGCGGCTCTGATGTCAATCCGA 

ProGlyLeuProValCysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeu 
2761 CCCCGGGGCTTCCCGTGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTA 

GGGGCCCCGAAGGGCAC^CGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGG 

ThrHisIleAspAlaHisPheLeuSerGlnThrLysGlnSexGlyGluAsnLeuProTyr 
2821 TCACTC^TATAGATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGG^ 
AGTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTC 

LeuValAlaTyrGlnAlaThrValCysAlaArgMaGlnAlaProProProSerTrpAsp 
2881 ACCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCC^ 
TGGACCATCGCATGGTTOGGTGGCACACGCGATC^ 
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GlnMetTrpLysCysI^uIleArgLeiiLysProThrLeuHisGly ProThrProI^uLeu 
2941 ACCAGATGlXK3AAGTGTTTGATTCGCCTCAAGCCCAa:C 
TGGTCTACACCTTCACAAACTAAGK^ 

IVrArgteuGlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlle 
3001 TATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGrc 
ATATGTCTCACCCGOGACAAGTCTTACTO 

MetThrCysMetSerAlaAspLeuGluValValThrSerThrTrpValLeuValGlyGly 
3061 TCATGACATGCATGT0GGCCGA0CTGGAGGTCX3TC 

AGTACTGTACGTACAGCCGGCTGGACCTCCAGC^T^ 

ValteuAlaAlaLeuAlaAlaTyrCyste^ 
3121 GCGTCCTGGCTGCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCG^ 
CGCAGGACCGAOGAAACCGGCGCATAAOGKSACAGTTGTCCGACGC^ 

ValValLeuSerGlyLysProAlallelleProAspArgGluValLeuTyrArgGluPhe 
3181 GGGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAG 
CCCAGCAGAACAGGCCCTTOGGCCGTTAGTATGGACTGTCCCTTC 

AspGluMetGluGluCysSerGlnHisLeuPro^ 
3241 TCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCG 
AGCTACTCTACCTTCTCACGAGAGTOGTGAATGGCATGTAGCTCGTTCC 

GluGlnPhcLysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVal 
3301 CCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGG 
GGCTCGTCAAGTTCGTCTTCCGGGAGCCXMAGGACGTCTGGCGCAGG 

IleAlaProAlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMet 
3361 TTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATA 
AATAGCGGGGACGACAGGTCIX^TTGACCGTTTTTGAGCTCT^ 

TrpAsnPhelleSerGlylleGlnTyrl^uAlaGlyLeuSerThrl^uProGlyAsnPro 

3421 TGTGGAACTTCATCAGTGGGATAC^TACTTGGCGGGCT^^ 

ACACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGG 

AlalleAlaSerLeuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGln 
3481 CCGCCATTGCTTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCC 
GGCGGTAACGAAGTAACTACCSAAAATCTCGACGACAGT^ 

ThrLeuLeuPheAsnllelieuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAla 
354 1 AAACCCTCCTCTTCAACATATTCGGGGGGTGGGTGGCTG^ 

TTTGGGAGGAGAAGTTGTATAACCCCCCCACCCACGGACGGGTCGAGCGGCGGGGGCCAC 

AlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGly 

3601 CCGCTACTGCCTTTGOX^CGCTGGCTTAGCT^^ 

GGCGATGACGGAAACACCCGOTACCGAATCGACCGC^CGK3TAGCCGTCACAACCTGACC 

LysValLeuIleAspIleLeiiAlaGlyTyrGlyAlaGlyValAlaGlyAla3^euValAla 

3661 GGAAGGTCCTCATAGACATCCTTGCAGGGTATOT^ 

CCTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACC 

PheLysIleMetSerGlyGluValProSerThrGluAspLeuValAsnL^ 
3721 CATTC^GATCATCAGO^TGAGGTCCCCTCCAC 

GTAAGTTCTAGTACTCX^CACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGC 

IleLeuSerProGlyAlaLeuValValGlyValValCysAlaAlalleLeuArgArgHis 
3781 CCATCCTCTCGCCCGGAGCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGGGGCGGC 
GGTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTGGTTATGACGCGGCCG 

ValGlyProGlyGluGlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArg 
3841 ACGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCC 
TGCAACCGGGCCCGCTCCCCOGTC^CGTCACCTACTTGGCCGACTATCGGAAGCGGAGGG 

GlyAsnHisValSerProThrHisTyrValProGluSerAspAlaAlaAlaArgValThr 
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3901 GGGGGAACCATGTTTCCCCCACGCACTACGTCCCG^ 
CCCCCTTX&TACAAAGGGGGTO 

MalleLeuSerSerLeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrpIleSer 
3961 CTGCCATACTCAGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAA 
GAOGGTATGAGraSTOSGAGTGAC^^ 

SerGluC^sThrThrProCysSerGlySerTrpLeviArgAspIleTrpAspTi^ 
4021 GCTCGG AG TGTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGG ACTGGATAT 
CGAGCCTCACATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATA 

GluValLeuSerAspPheLysThrTrpLeuLyaAlaLysLeuMetProGlal^uP 
4081 GCGAGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGC^ 
CGCTCCACAACTCGCTGAAATTCTGGACCGATTCT 

IleProPheValSerCysGlnArgGlyTyrLysGlyValTrpArgValAspGlylleMet 
4141 GGATCCCCTTTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTCGC^ 

CCTAGGGGAAACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTC 

HisThrArgCysHisCysGlyMaGluIleThrGlyHisValLysAsnGlyThrMetArg 
4201 TGCACACTCGCTGCCACTGTGGAGCTCAGATCACTGGACATGTC^ 

ACGTGTGAGCGACGGTGACACCIXXaACTCTAGTGACCTGTACAGTTTTTO 

IleValGlyProArgThrCysArgAsnMetTrpSerGlyT^ 
4261 GGATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCT 
CCTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGA 

ThrThrGlyProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpAr^Val 
4321 ACACCACGGGCCCCTGTACCCCCCTTCCTGOGCCGAACTACACGTTCGCGCTATGGAGGG 
TGTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGC^ 

SerAlaGluGluTyrValGluIleArgGlnValGlyAspPheHisTyrValThrGlyMet 
4381 TGTCTGCAGAGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTA 
AC^GACGTCTCCTTATACACCTCTATTCCGTCCACC^CCTGAAGGTGATGCACTGCCCAT 

ThrThrAspAsnl^uLysCysProCysGlnValProSerProGluPhePheThrGluLeu 
4441 TCACTACTCACAATCTCAAATGCCCGT^ 

ACTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 

AspGlyValArgl^uHisArgPheAlaProProCysLysProLeuLexiArgGluGluVal 
4501 TGGACGGGGTGCGCCTACATAGGTTTGOGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGG 
ACCTGCCCCACGCGGArcTATCCAAACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCC 

SerPheArgValGlyLeuHisGluTyrProValGlySerGlnLeuProCysGluProGlu 
4561 TATCATTCAGAGTAGGACTCCAOGAATACCCGGTAGGGTCGCAATTACCTTGCGAGCCCG 
ATAGTAAGTCTCATCCTGAGGTGCS^^ 

ProAspValAlaValLeuThrSerMetlieuThrAspProSerHisIleThrAlaGluAla 
4621 AACCGGACGTGGCCGTGTTGACGTCCATGCTCACTGATCCCTCCCATATAACAGCAGAGG 
TTGGCCTGCACCGGCACAACTGCAGGTACX3AGTGACTAGGGAGGGTATATTGTCGTCTCC 

AlaGlyArgArgLeuAl£ArgGlySerProProSerValAlaSerSerSerAlaS€trGln 

4681 cggccgggcgaaggttggcgaggggatcacccccctctgtggccagctcctcggctagcc 
gcoggcccgcttccaacox;tcccctagtggggggagacaccggtc^ 

I^uSerAlaProSerLeuLysAlaThrCysThrAlaAsnHisAspSerProAspAlaGlu 
4741 AGCTATCCGCTCCATCTCTCAAGGCAACTTGCACCGCTAACCATGACTCCCCTGATGCTG 
TCGATAGGCGAGGTAGAGAGTTCCGTTCAACGTGGCGATTGGTACTGAGGGGACTACGAC 

Leul leGluAlaAsnLeuLeuTrpArgGlnGluMetGlyGlyAsn I leThr ArgValGlu 
4801 AGCT^TAGAGGCCAACCTCCTATGGAGGCAGGAGATGGGCGGCAACATCACCAGGGTTG 
TCGAGTATCTCCGGTTGGAGGATACCTCCGTCCTCTACCCGCOGTTGTAGTGGTCCCAAC 

SerGluAsnLysValVallleLeuAspSerPheAspProLeuValAlaGluGluAspGlu 
4861 AGTCAGAAAACAAAGTGGTGATTCTGGACTCCTTCGATCCGCTTGTGGCGGAGGAGGACG 
TCAGTCTTTTGTTTCACCACTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTCCTGC 
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4921 AGCGGGAGATCTCCGTACCCGCAGAAATCCTGOGGAAGTCTCX3GAGATTCGCCCAGGCCC 
TCX3CCCTCTAGAGGCATGGGCGTCTTTAGGACGCCTO 

, Art , ProV alTrpAlaArgProAspTyrAsnProProLeuValGluThrTrpLysLysPrQAsp 
4981 TGCCCGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCG 
ACGGGCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTO 

TyrGluProProValValHisGlyCysProLeuProProProLysSerProProValPro 
504 1 ACTACGAACCACCTGTGGTCCATGGCTGTCCQCTTCCACCTCCAAAGTCCCCrc 
TGATGCTTGGTGGACACCAGGTACCGACAGGCGAAGGTGGACX3TTO 

c , Prop roArgLysLysArgThrValVal^^ 

5101 CTCCGCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGG 
GAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACC 

GluI^uAlaThrArgSerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThr 
5161 CCGAGCTCGCCACCAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATA 
GGCTCGAGCGGTGGTCTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTAT 

c ^ ~ TteT ^S e rSerGluProAlaProSertlyCysProProAspSerAspAlaGluSerTyr 
5221 CGACAACATCCTCTGAGCCCGCCCCTTCTK 

GCTGTTGTAGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGA 

SerSerMetProProLeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrp 
5281 ATTCCTCCATGCCCCCCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCAT 
TAAGGAGGTACGGGGGGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTA 

SerThrValSerSerGluAlaAsnAlaGluAspValValCysCy sSerMetSerTyrSer 
5341 GGTCAACGGTCAGTAGTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATC 

CCAGTTGCCAGTCATCACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAA 

c . ~ ^PT^iyAlal^uValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAla 
5401 CTTGGACAGGCGCACTCGTCACCCCGTGCGCCXSCGGAAGAAC^^ 

GAACCTGTCCGCGTGAGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTC 

~ ^^ er AsnSerteuLeuArgHisHisAsnLeuValTyrSerThrThrSerArgSerAla 
5461 CAC T AAGCAAC TCG TTGC TACG TCACCACAATTTGG TGT ATTCCACCAC C TCACGCAG TG 

GTGATTCGTTGAGCAACGATGCAGTGGTGTTAAACCACATAAGGTGGTC 

ceo, cjs 11 *^^ 

5521 CTTGCCAAAGGCAGAAGAAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTACC 
GAACGGTTTCCGTCTTCTTTCAGTGTAAACTCTCTG 

_ AspValLeuLysGluValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerVal 
5581 AGGACG TACTCAAGG AGGTTAAAGCAGCGGCGTCAAAAGTG AAGGCTAACTTGC TATCCG 
TCCTGCATGAGTTCCTCCAATTTCGTCGCCGCAGTTTTCACTTC 

GluGluAlaCysSerLeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAla 
5641 TAGAGGAAGCTTGCAGCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTT^ 
ATCTCCTT(X3AACGTCGGACTGCGGGGGTGT^ 

LysAspValArgCysHisAlaArgLysAlaValThrHisIleAsnSerValTrpLysAsp 
5/01 CAAAAGACGTCCGTTGCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAG 
GTTTTCTGC^GGCAACGGTACGGTCTTTCCGGCATTGGG 

LeuLeuGluAspAsnValThrProIleAspThrThrlleMetAlaLysAsnGluValPhe 
5761 ACCTTCTGGAAGACAATGTAACACCAATAGACACTACCATCATGGCTAAGAACGAGGTTT 
TGGAAG ACCTTCTG TTACATTG TGGTTATCTGTGATGGTAGTACCG ATTGTTGCTC 

CysValGlnProGluLysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeu 
5821 TCTGCGTTCAGCCTGAGAAGGGGGG1XX3TAAGCCAQCTCGTCTCATCGTC 

AGACGCAAGTCGGACTCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAG 

GlyValArgValCysGluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAla 
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5881 TGGGCGTGCGCGTGTGCX3AAAAGATGGCTTTGTACGAC6TGGTTACAAA 
ACCCGCACGCGCACACGCtTTTCTACCGAAACATGCroCACCAATC 

ValWetGlySerSerTyrGlyPheGlnTyrSerProGlyGlnArgValGluPh 
5941 CCGTGATGGGAAGCTCCTACGGATTCCAATACTCACCAGGACAGOGGGTTGAATTCCTC^ 
GGCACTACCCTTCGAGGATGCCTAA<&TT^^ 

GlnAlaTrpLysSerLysLysThr ProMetGlyPheSerTyrAspThrArgCysPheAsp 
6001 TGCAAGCGTGGAAGTCCAAGAAAACCCC^TGGGG^ 
ACGTTCGCACCTTCAGGTTCTTTTCGGGTTACOOCAA^ 

SerThrValThrGluSerAspIleArgThrGluGluMalleTyrGlnCysC^sAspLeu 

6061 ACTCCACAGTC^CTGAGAGCGACATCCG^ 

TGAGGTGTCAGTGACTCTCXKITGTAGGCATGCCTCCTCCGra 

AspProGlnAlaArgVamalleLysSerLeuTh^ 
6121 TCGACCCCCAAGCCCGCGTGGCCATC^ 

AGCTGGGGGTTCGGGCGCACCGGTAGTT^ 

LeuThrAsnSerArgGlyGltiAsnCysGlyTyrArgArgC^sArgAlaSerGl^^ 
6181 CTCTTACC^UVTTCAAGGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCGAGCGGGGTAC 
GAGAATGGTTAAGTTCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGCTCGCCGCATG 

ThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysArgAla 
6241 TGACAACTAGCTGTGGTAACACCCTCACTTGCTACATCAAGGCCGGGGCAGCCTGTCGAG 
ACTGTTGATCGACACCATTGTGGGAGTGAACGA 

AlaGlyLeuGlnAspC^sThrMetl^uValCysGlyAspAspIieuValVallleCysGlu 
6301 CCGCAGGGCTCCAGGACIXJCACCATGCTCGTGTGTGGCGACGACTTAGTC^ 

GGCGTCCCGAGG TCCTGACGTGGTACGAGC ACACACCGCTGCTGAATCAGCAATAGACAC 

SerAlaGlyValGlnGluAspAlaAlaSerLeuArgAlaPheThrGluAlaMetThrArg 
6361 AAAGCGCGGGGGTCCAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGACCA 
TTTCGCGCCCCCAGGTCCTCCTGCGCCGCTCGGACTCTCGGAAGTGCCTCCGATACTGGT 

TyrSerAlaProPrcClyAspProProGlnProGluTyrAspl^uGluLeuIleThrSer 
6421 GGTACTCCGCCCCCCClX^^ACCCCCCACAACCAGAATACGACrTGGAG^ 

CC^TGAGGCGGGGGGGACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTATTGTA 

CysSerSerAstiValSerValAlaHisAspGlyAlaGlyLysArgValTyrTyrl^uThr 
6481 CATGCTCCTCCAACGTGTCAGTCGCCCACGACGGCG 

GTACGAGGAGGTTGCACAGTCAGCGGGTGCTGCCGCGACCTTTCTCC 

ArgAspProThrThrProLeuAlaArgAlaAlaTrpGluThrAlaArgHisThrProVal 

6541 cccgtcaccctacaacccccctcgcx;agagctgogtgggagacagcaagaca(^ 
gggcactgggatgttggggggagcgctctcgaog»ccctctc 

AsnSerTrpLeuGlyAsnllelleMetPheAlaPrbThrLeuTrpAlaArgMetlleLeu 
6601 TCAATTCCTGGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGGGCGAGGATGATAC 
AGTTAAGGACCGATCCGTTGTATTAGTACAAACGGGGGTGTC 

MetThrHisPhePheSerValLeuIleAlaArgAsp^ 
6661 TGATGACCCATTTCTTTAGCGTCCTTATAGCCAGGGACCAGCTTGAACAGGCCGTOGATT 
ACTACTGGGTAAAG AAATGGCAGGAATATCGGTCCCTGGTCGAACTTGTCCGGG AGCTAA 

GluIleTyrGlyAlaCysTyrSerlleGluProI^uAspLeuProProIlelleGlnAfg 
6721 GCGAGATGTACGGGGCCTGCTACTCC^TAGAACCACTTGATCTACCTCCAATCATTCA^ 
CGCTCTAGATGCCCCGGACGATGAGGTATCTTGGTGAACTAGATGGAGGTTAGTAAGTTT 


Leu 

6781 GACTC 
CTGAG 
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FIG. 33 LEGEND 





Sample 



Chimp 


date (days) 

ALT (alanine) 

tane 

Reference 

Infection 

( 0-inoculation 

aminotransferase 

Number 

Number 

Type 

day) 

level in sera) mt^/mi ) 

1 

1 

NANB 

0 

A 

: * 

2 

I 

NANB 

76 

71 

3 

I 

NANB 

ue 

19 

4 

I 

NANB 

: 154 

. N/A 

5 

2 

NANB 

.0 

■ 5 

6 

2 

NANB 

21 

52 

7 

2 

NANB 

73 

13 

8 

2 

NANB 

138 

N/A 

9 

3 

NANB 

0 

9 

10 

3 

NANB 

• 4 3 . 

205 

J 1 

3 

NANB 

53 

14 ' 

12 

3 

NANB 

159 

6 

13 

4 

NANB 

-3 

11 

14 

4 

NANB 

55 

132 

1 5 

4 

NANB 

83 

n/A 

16 

4 

NANB 

140 

N/A 

17 

5 

KAV 

0 

■ ' ' ' 4 

16 

5 

HAV 

25 

14 7 

X 9 

5 

HAV 

40 

18 

20 

5 

HAV 

268 

■ 5 

21 

6 

HAV 

-8 • 

N/A 

22 

6 

HAV 

". 15 ■ ■ 

106 

2 3 

6 

HAV 

41 

10 

24 

6 

HAV 

129 

N/A 

26 

7 

HAV 

0 

.7 

27 

7 

HAV 

22 

83 

28 

7 

HAV 

115 

5 

29 

7 

HAV 

139 

N/A 

30 

8 

HAV 

0 

15 

31 

8 

HAV 

26 

no 

32 

8 

HAV 

74 

: - 8 ■ 

33 

8 

HAV 

205 

■ ■ 5 ■ 

34 

9 

HBV 

-290 

N/A 

35 

9 

HBV 

379 

9 

36 

9 

HBV 

: .435 ■ , • 

■ 6 . 

37 

10 

HBV 

0 

8 . 

38 

10 

HBV 

111-118 (pool) 

96-156 (pool) 

39 

10 

HBV 

205 

9 

40 

10 

HBV 

240 

13 

41 

11 

HBV 

0 

11 

42 

11 

HBV 

28-56 (pool) 

8-100 (pool) 

43 

11 

HBV 

169 

9 

44 

11 

HBV 

223 

10 
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FIG. 34 LEGEND 

Patient 
Lane Reference 
Number Number Diagnosis ALT Level (mu/mll 


1 

, 1 
1 

NANB 

1 *l C A 

2 

. i 
l l 

NANB 


3 

- I 

! 

NANB 

Id 

4 


NANB 

79 

5 

\\ 

NANB 

26 

6 

3 1 

MAUD 

NANB 

7ft 

7 

! 

u a r-io ■ ■ 
NAffo 

A 7 

8 


NANB 

25 

9 

«• 

NANB 

60 

10 

A 1 

4 A 

NANB 

1 1 
X J 

1 1 

c 1 

l\ 

NAflD 


1 2 

5 * 

MAUD 

101 

■ 3 

1 3 

el 


4 74 

1 A ' 

6 

O 

MA MR. 

318 

1 c 
1 3 

7 1 

NANA 

20 

1 o 

7 1 
/ 

NANA 

163 

1 ' 

fll 

o 

MAMA 

44 

10 

Pi 

NANA 

SO 

1 o 
I 7 

0 

J 

MAMD 

N/A 


1U 



21 

11 

NANB 

N/.A 

22 

12 

Normal 

M/A 

N/A 


i 1 
I J 

Norma X , 

N/A 



NnrnA 1 

N/A 

26 

30174 

Normal 

N/A 

27 

10 ] OS 

JulvJ 

Nil r*m a 1 

N/A 

28 

30072 

Normal 

N/A 

79 

JVvtO 

Ma rmA 1 

N/A 

10 

J V 

10 1 4 £ 
i TO 

Ns*% vwn m 1 . 
HUfcind JL 

N/A 

1 1 

oni en 

rtuxmci x 

n/a 

32 

30071 

No rma 1 

N/A 

33 

15 

AcuteHAV 

N/A 

34 

16 

AcuteHAV 

N/A 

35 

17 

AcuteHAV 

N/A 

36 

18 

AcuteHAV 

N/A 

3 7 

48088 

AcuteHAV 

N/A 

36 

47288 

AcuteHAV 

N/A 

39 

47050 

AcuteHAV 

N/A 

40 

46997 

AcuteHAV 

N/A 

41 

19 

Convalescent HBV 

N/A 

42 

20 

(anti-HBSag+ve; 

N/A 

4 3 

21 

ainti-HBCag+ve) 

N/A 

44 

22 

(anti-HBSag+ve; 

N/A 

45 

23 

anti-HBCag+ve) 

N/A 

46 

24 

(anti-HBSag+ve; 

N/A 

47 

25 

ariti-HBCag+ve) 

N/A 

48 

26 

(anti-HBSag+ve; 

N/A 

49 

27 

anti-HBSag+ve) 

N/A 


Sequential serum samples were assayed from these patients 
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FIG. 34- 1 


FIG 34-2 
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FIG. 36-I COOH-terminus of SOD-C100 Fusion Polypeptide 

. scjI> _„„ — COOH] [--adaptor— -] [NANBHpolypeptide> 

AlaCysGlyVallleGlylleAlaGlnAsnLeuGlylleArgAspAlaHisPheLeuSer 
1 GCTTGTGGTGTAATTGGG ATCGCCCAGAATTTGGGAATTCG^^ 

CGAACACCACATTAACCCTAGCGGGTCTTAAACCCTOAAGCCCTACGGGTGAAAGATAGG 


>>>>>>>»>>>>>>»»» 
GlnThrLysGlnSerGlyGluAsnLeuPro^ 
61 CAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGGGTAC^ 
GTCWTTTOGTCTCArc^ 

AlaAi^MaGlnAlaProProProSerTrpAspGl 
121 GCTAGGGCTCAAGCCCCTCCCCCAT^ 

CGATCCCGMTTCGGGGAGGGGGTAGCACCCTGGTCTAC^ 

LysProThrLeuHisGlyProThrProLeuLeuT^ 

181 AAGCCCACCCTCCATCGGCC^ 

TTCGGGTGGGAGGTACCOGGTTGTGGGGACGATATGTC 

IleThrl^uThrHisProValThrLysTyrlleMetThrCyaMetSerAlaAspLeuGlu 
241 ATCACCCTGACGC^CCCAGTCACCAAATACATCATGACAT^ 

TAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTAOGTACAG 

ValValThrSerThrTrpValLeuValGlyGlyValLeuAlaAlaLeu^ 
301 GTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGT^ 
CAGCAGTGCTCGTGGACCCAGGAGCAACOGCCGCAGGACC^ 

I^uSerThrGlyCysValVallleValGlyArgVa^ 
361 CTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTC 
GACAGTTGTCCGACGCACCAGTATCACCOGTCCCAGCAGA^ 

IleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGlnHis 
421 ATACCTGACAGGGAAGTCCTCTACCGAGAGTTCX^TGAGA 

TATCGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACC1TCTCACGAGAGTCGTO 

LeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnL^ 
481 TTACCGTACATa^GCAAGGGATGATG^ 

AATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCXSTCAAGTTCGTCTTC^ 

LeuteuGlnThrAlaSerArgGlnMaGluVallleAlaProAlaValGlnThrAsnTrp 

541 ctcctgcagaccgcgtcco;tca# 

gaggacgtctggcgcagggcagtcogtctccaatagoggggacgaca 

GlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyr 
601 CAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGT^ 
GTTTTTCAGCTCTGGAAGACCCGCTTCGTATACArc 

LeuAlaGlyl^uSerThrLeuProGlyAsnProAlali 
661 TTGGCX5GGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGC 
AACOGCCCGAACAGTTGCGAOGGACCATTGGGGOGGTAA 

AlaAlaValThrSerProLeuThrThrSerGln^ 
721 GCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTC 
CX^CGAC^TGGTC^TGATTGGTGATCGGTTTGGGA 

TrpValAlaAlaGlnLeuAlaAlaProGlyAlaAldThrAlaPheValGlyAlaGlyLeu 
781 TGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTA 
ACCCACCGACGGGTCGAGCGGCGGGGGCCACGGOGATGAOGGAAACACC 

AlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeuAlaGly 
841 GCIX3GCGCCGCCATOGGCAGTGTTGGACTGGGGAAGGTC 
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CGACCGCGGCGGTAGCCGTCACAAC&TGACCCCTTCCAGGAGT^^ 

TyrClyAlaGlyvamaGlyAlaLeuValAlaPheLyalleMetSerGlyGluValPro 

901 TAIGGCGCGGGCGTGGCGGGlttSCTCTTC 

ATAC0GCGOX»CAC0GC(XnXX3AGAACACCGTAAGTTCT 

SerThrGluAspI«uValA8nLeul«uProAlaIleLeuSerProGlyAlaLeuValVal 
961 TCCACGGAGGACCTGGTCAATCTACTGCCC6CCATCeTCTC6CCCGGAGCCCTC6TAGTC 
AGGTGCCTCCTGGACCAGTTAGATGACGGGCX^ 

GlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGluGlyAiaValGln 

1021 ggcgtcgtctgtgcagc^tactgcgccggc^^ 

ccgcac(^gacacgtcgttatgacgoggccgtgq^cx;gggcccgctcccccgtcaq»tc 

«<<«<«<<<<<««<<NANBH] [ — «ctra 
TrpMetAsnArgl^uIleAlaPheAlaSerArgGlyAsnHisValSerProValHisHis 

1081 tggatgaaccggctgatagccttcgottc^ 
acctactnmccgactatoggaagcggagggtc 

— .-] 

Ly sArgOP 

114 1 AAGCGTTGACGCTCCCTACGGGTGGACTGTGGAGAGACAGGGCACTGCTAAGGCCCA^ 
TTCGCAACTGCGAGGGATGCCCACCTGAGACCTCTCTGTC^ 

1201 CTCAGCCATGCATCGAGGGGTACAATCCX3TATGGCCAACAACTAGCGOGTACGT 

GAGTCGGTACGTAGCTCCCCATGTTAGGCATACCGGTT6TT6ATCGCGCATGCAOT 

1261 TCCTTTCTCGATGGTCCATACCTTAGATGCGTTAGCATTAATCOGAATTC 
AGGAAAGAGCTACCAGGTATGGAATCTACGCAATCGTAATTAGGCTTAAG 


FIG. 36-2 
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FIG. 4I-I 

Homology between the HCV polypeptide encoded by combined ORF of clones 


14i through 39c) and the non-structural protein of the Dengue flavi- 


virus(MNWVDl). 


10 20 30 40 50 

HCV EYVVLLFTjLLABARVCSC^ 

MNWVD1 AVSFVTLI1£NMSFRDLGRVMVMVGATMTDI)TC 

130 140 150 160 170 180 

60 70 80 90 100 110 

HCV WYIJCGKWVPGAVYTFYGMWPLLLLU^ 

MNWVD1 TSKEI/4MTTIGrVLLSQSTIPE^ 

190 200 210 220 230 240 

120 130 140 150 160 170 

HCV KR Y ISWCLWWLQYFLTRVEAQLHVWI PPLNVR 

: . . . : . ; * . . . : : . : . : . : 

MNWVD1 NAVIIflNAWKVSCTILAWSVSPIJLTSSM^ 

250 260 270 280 290 

180 190 200 210 220 230 

HCV rePLWILQASLLKVPYF-VKV^ 

MNWVD1 KKRSWPliiai^VGMVSILASS 

300 310 320 330 340 350 

240 250 260 270 280 290 

HCV TPLRDWAHNGLRDLAVAVEPVVFSC^IETKLITWCSADTAACGDI INGLPVS ARRGREILLG 

• • • . > • • • •••••• 2 . • 2 

MNWVD1 ADVK-WEIX)AEISGSSPILSITISE-IX3SMSIKNEEEEQTLTILIRTGLLVI^^^ 
360 370 380 390 400 410 

300 310 320 330 340 350 

HCV PADGMVSKGWRLIAPITAYAQQTRGIJ^CIITSLTGRDKNQVK 

* ••••••• • • 8 • • . • ••>••••• >•■••*. ••• 

MNWVD1 VSIPITAAAWYLWE^KQRAGVLVroVPSPPPVGKAELEDGAYRIKQKGILGYSQIGAGVY 
420 430 440 450 460 470 

360 370 380 390 400 410 

HCV INGVCWTVYHGAGTRTIASPKGPVIQMYTNVDQDLV GWPAPQGSRSLTPCTCGSSD 

MNWVD1 KKTFHTMWHVTRGAVI/1^ 

480 490 500 510 520 530 

420 430 440 450 460 470 

HCV LYLVTRHAWIPVRRRGDSRGSLI^PRPISYLKGS 

• • • ••• ... : . t * . : , , . . : ::• 

MNWVD1 PGKNPRAVQTKPGLFKTN — AGTIGAVSLDFSPGTSGSPIIDKKGKWGLYGNGWTRSG 

540 550 560 570 580 590 

480 490 500 510 520 530 

HCV AKAVDF I PVENLETTMRS PVFTDNSS PPWPQSFQVAHLHAPTGSGKS— TKVP AAYAAQ 

• , 9 w » » 8 • • • • !••!• • ••• * • I • ■ •••••• • • 

MNWVD1 AYVSAIAQTEK — SIEDNPEIEDDIFRK R^TIi!DLHPGAGKTKRYLPArVRGAI KR 

600 610 620 630 640 

540 550 560 570 580 
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HCV G YKVLVLNPS — VAATLGFGAYMSKAHG IDPNIRTGVRTITTGS P ITYSTYGKFLADGGC 

MNWVD1 6l*RttIlJtfTRVVAAiE^ 

650 660 670 680 690 700 

590 600 610 620 630 640 

HCV SGGAYDIIICDKHSTrATSII^IGTVU^^ 

X :: :: : . : . :s:::t* . : : V. 

MNWVD1 RVPNY^ilMDEMFTDPASIAARGYISTRV^ 

710 720 730 740 750 760 

650 660 670 680 690 700 

HCV ALSTTGEIPFYGKAIPLEVIKGGRHLIFCHSKKKCDELAAKLVAI^ 

MNWVD1 IMDEERE I PERSWSSGHEWVTOFKGKTVW^VPS I KAGNDTAACLRKNGKKVTQLSRKTFD 

770 780 790 800 810 820 

710 720 730 740 750 760 

HCV iptSGDVVWATDAIWTGYTGD^ 

MNWVD1 SEYVKTRTNDWNFVVTTO^ 

830 840 850 860 870 880 

770 780 790 800 810 820 

HCV QRRGRTGRGK PG I YRFVAPGERPSGMFDSSVI^ECYDAGCAWYELTPAETTVRLRAYMNT 

MNWVD1 SS 


FIG. 41-2 
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FIG. 43 

DISTRIBUTION OF RANDOM SAMPLES 

C 100-3 Ag ELISA Preclinical Kit 

416ng C100/WELL. 2 HRS 37°C, 20ul SAMPLE 


637 n 


300- 


O 250 4 

99 200 4 
q: 

V} 150 4 
Q 

10041 


50- 


n=1056 


Wvl I I 111 I I I I T 
0 .1 .2 .3 .4 .5 .6 .7 .8 .91.0, 1.5 >3.0 

OD @ 492 nm 
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FIG. 44 

Distribution of O.D« Valu«c for 
Random Blood Donor Saaplcs Tested with Two BLISA 
Configurations 

C 100-3 Ag ELISA MoAB vs Polyclonal 


637 


300 + 

O 250 
D 

eg 200 + 
q: 

£ 150 + 
Q 

100-- 
50-- 
0 


Jj 


JjLjl 


MoAB « 1056 
Poly - 1077 


.2 


.5 


.7 


1-0 


OD @ 492nm 


>3.0 
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FIG. 45 


Name Common Sequence Variable Sequence 


5' -3-1 AAGCTTGATCGAATTC CGAfCTTGC 

-2 CGATCCTGC 

-3 CGATCATGC 

-4 CGATCGTGC 

-5 CGAAGTTGC 

_6 CGAAGCTGC 

-7 AGATCTTGC 

-8 AGATCCTGC 

-9 AGATCATGC 

-10 AGATCGTGC 

-11 AGAAGTTGG 

-12 AGAAGCTGC 

-13 CGATCTTGT 

-14 CGATCCTGT 

-15 CGATCATGT 

-16 CGATCGTGT 

-17 CGAAGTTGT 

-18 CGAAGCTGT 

-19 AGATCTTGT 

-20 AGATCCTGT 

-21 AGATCATGT 

-22 AGATCGTGT 

-23 AGAAGTTGT 

-24 AGAAGCTGT 

-25 CGCTCTTGC 

-26 CGCTCCTGC 

-27 CGCTCATGC 

-28 • ■ ■ CGCTCGTGC 

-29 CGCAGTTGC 
-30 • : CGCAGCTGC 

.;' -31 CGCTCTTGT 

-32 CGCTCCTGT 

-33 CGCTCATGT 

-34 CGCTCGTGT 

-35 CGCAGTTGT 

-36 CGCAGCTGT 
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FIG. 46-1 Translation of DNA k9-l 

GlyCysProGluArgLeuAlaSerCysArgProI^uTh^ 
1 CAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGG 
GTCCGACAGGACTGTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCC 

ProIleSerTyrAlaAsnGlySerGlyProAspGlnArgProTyrCysTrpHisTyrPro 
61 GCCCTATCAGTTATGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACC 
CGGGATAGTCAATACGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGATGG 

ProLysProCysGlylleValProAlaiysSer^ 
121 CCCOU^CCTTGCGGTATTGTGCCCGCGAAGAGTCTGTGTGGTC^ 

GGGGTTTTGGAACGCCATAACACGGGCGCTTCTCACACACACCAGGCCATATAACGAAGT^ 

ProSerProValValValGlyThxThrAspAr^^ 
181 CTCCCAGCCCCGTGGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCTACAGCTGGG 
GAGGGTCGGGGCACCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCC 

GluAsnAspThrAspValPheValLeuAsnAsnThrArgProProLeuGlyAsnTrpPhe 
24 1 GTGAAAATGATACGGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGT 
CACTTTTACTATGCCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCA 

GlyCysThrTrpMetAsnSerThxGlyPheThrLysValCysGlyAlaProProCysVal 
301 TCGGTTGTACCTCGATGAACTCAACTGGATTCACCAAAGlCTG 

AGCCAACATGGACCTACTTGiAGTTGACCTAAGTGGTTTCACACGCCTGGGGGAGGAACAC 

IleGlyGlyAlaGlyAsnAsnThrLeuHisCysProThrAspCysPheArgLysHisPro 
361 TCATCGGAGGGGCGGGCAACAACACCCTGCACTGCCCGACTGATTGCTTCCGCAAGCATC 
AGTAGCCTCCCCGCCCGTTGTTGTGGGACGTGACGGGGTGAGTAACGAAGGCGTTCGTAG 

AspAlaThrTyrSerArgCysGlySerGlyProTrpIleThxProArgCysLeuValAsp 
421 CGGACGCCACATACTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCG 
GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGC 

TyrProTyrArgLeuTrpHisTyrProCysThrlleAsnlVrthrllePheLysIleArg 
481 ACTACCCGTATAGGCTTIX^CATTATCCTTGTACCATCAACTACACTATATTTAAAATCZA 
TGATGGGCATATCCGAAACCGTAATAGGAACATGGTAGTTGATGTGATATAAATTTTAGT 


MetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsnTrpThxArgGlyGlu 
541 GGATGTACGTGGGAGGGGTCGAGCACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCG 
CCTAC^TGC^CCCTCCCCAGCTCGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGC 

ArgCysAspI^uGluAspArgAspArgSerGluI^uSerProI^uI^uLeuThrThrThr 
601 AACGTTGCGATCTGGAAGATAGGGACAGGTCCGAGCTCAGCCCGTTACTCCTGACCACTA 
TTGCAACGCTAGACCTTCTATCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTGGTGAT 

GlnTrpGlnValLeuProCysSerPheThrThxLeuProAlaLeuSerThxGlyLeuIle 
661 CACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTGCCAGCCTTGTCCACCGGCCTCA 
GTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGACGGTCGGAACAGGTGGCCGGAGT 

— Overlap with Combined ORF of DHAs 12 f through 15e— — 

HisLeuHisGlnAsnlleVaLAspValGlnTyrLeuTyrGlyValGlySerSerlleAla 
721 TCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCG 
AGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGC 

SerTrpAlalleLysTrpGluTyrValYalLeiiLeuPheLeuLeuIjeuAlaAspAlaArg 
781 CGTCCTGGGCCATTAAGTGGGAGTACGTCGTCCTCCTGTTCCTTCTGCTTGCAGACGCGC 
GCAGGACCCGGTAATTCACCCTCATGCAGCAGGAGGACAAGGAAGACGAACGTCTGCGCG 
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ValCysSerCysLeuTrpMetMetLeuLeuIleSerGiciAlaGluAlaAlaLeuGluAsn 
841 GCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGGAAGCGGCTTTGGAGA 
CGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCCTTCGCCGAAACCTCT 


LeuVallleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeuValSerPheLeuVal 
901 ACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTCTIGTATCCTTCCTCG 
TGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGC 


PhePheC^sPheAlaTrpTyrl^uLysGlyLysTrpValProGlyAlaValTyrThrPhie 
961 TGTTCTTCTGCTTTGCATGGTATCTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCT 
ACAAGAAGACGAAACGTACCATAGACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGA 


TyrG 1 yMetTrp P roLeuLeuLeuLeuLeuLeuAl aLeuPr oG 1 nAr gAl aTyrAl aLexi 
1021 TCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGC 
AGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCG 


AspThrGluValAlaAlaSerCysGlyGlyValValLeuValGlyLeuMetAlaLeuThr 
10 8 1 TGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTAA 
ACCTGTGCCTCCACCGGCGCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGATT 


LeuSerProTyrTyrLysArgTyrlleSerTrpCysLeuTrpTrpLeuGlnTyrPhelieu 
114 1 CTCTGTCACCATATTACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTC 
GAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAG 


ThrArgValGluAlaGlnLeuHisValTrpIleProProLeuAsnValArgGlyGlyArg 
1201 TGACCAGAGTGGAAGCGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGC 
ACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCG 


AspAlaVallleLeuLeuMetCysAlaValHisProThrLeuValPheAspIleThrLys 
1261 GCGACG CTGTCATCTTACTCATGTGTGCTG TACACCCGACTCTGGTATTTGACATCACCA 
CGCTGCGACAGTAGAATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGT 


LeuLeuLeuAlaValPheGlyProLeuTrpIleLeuGlnAla 
1321 AATTGCTGCTGGCCGTCTTCGGACCCCTTTGGATTCTTCAAGCCAG 
TTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTC 


FIG. 46-2 
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p|Q 4 7- | COMBINED ORF OF DNAs K9-1 through 15e 

GlyCysProGluArgl^uMaSerCysArgProteuTte^ 
1 CAGGCTGTCCTCAGAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTC^CMGGCTC 
GTCCGACAGGACTCTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGG 

ProIleSerTyrMaAsnGlySerGlyProAspGlnArgProTyrCysTrpHisTyr 
6 1 GCCCTATCAGTTATGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACC 
CGGGATAGTCAATACGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGATGG 

ProLysProCysGlyllevalProAlaLysSerValCysGlyProVa^ 
121 CCCCAAMCCTTGCGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTT^ 
GGGGTTTTGGAACGCCATAACACGGGCGCTTCTCACACACACC^ 

ProSerProValValValGlyThrThrAspArgSe^ 
181 CTCCCAGCCCCGTGGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCT 
G AGGGTCGGGGCACCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGG 

GluAsnAspThrAspValPheVall^uAsnAsnThrArgPro^ 
241 GTGAAAATG ATACGGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGT 
CACTTTTACTATGCCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCG 

GlyCysThrTrpMetAsnSerThrGlyPheThr^^ 
301 TCGGTTGTACCTGGATCAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTC 
AGCCAACATGGACCTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACAC 

IleGlyGlyAlaGlyAsnAsnThrLeuHisCysProThrAspCysPheArgLysHisPro 
361 TCATCGGAGGGGCGGGCAACAACACCCTCCACTGCCCCACTGATTGCTTCCGCAAGCATC 
AGTAGCCTCCCCGCCCGTTGTTGTGGGACGtTGACGGGGTGACTAACGAAGGCGTTCGTAG 

AspAlaThrTyrSerArgCysGlySerGlyProTrpIleThrProArgCysLeuValAsp 
421 CGGACGCCACATACTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCG 
GCCTGCGGTGTATG AGAGCCACGCCG AGGCCAGGGACCTAGTGTGGGTCCACGGACCAGC 

TyrProTyrArgl^uTrpHisTyrProCysThrlleAsnTyrThrllePheLysIleArg 
481 ACTACCCGTATAGGCTTTGGCATTATCCTTGTACCATCAACTACACCATATTTAAAATCA 
TGATGGGCATATCCGAAACCGTAATAGGAACATGGTAGTTGATGTGGTATAAATTTTAGT 

MetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsnTrpThrArgGlyGlu 
541 GGATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCG 
CCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGC 

ArgCysAspLeuGlxiAspArgAspArgSerGluI^uSerProI^uI^uI^uThrThr^ 
601 AACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTA ; 
TTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGGAATGACGACTGGTC 

GlnTrpGlnVall^uProCysSerPheThrThrl^uProAlal^uSerThrGlyl^uIle 
661 CAGAGl^CAGGTCCTCCCGTGTTCCTTCACAACCCTACGAGCCTTGTCCACGGGCCTCA 
GTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGATGGTCGGAACAGGTGGCCGGAGT 

HisLeaHisGlnAsnlleValAspValGlnTyrLeuTyrGlyValGlySerSerlleAla 
721 TCCACCTCCACCAiG AACATTGTGG ACGTGC AGTACTTG TACGGGGTGGGG TC AAGCATCG 
AGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGC 

SerTrpAlalleLysTrpGluTyrValVall^ 
781 CGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGC 
GCAGGACCCGGTAATTCACGCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCG 

ValCysSerCysl^uTrpMetWetLeuLeuIleSerGlnAlaGluAlaAlaLeuGluAsn 
841 GCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGA 
CGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCT 

I^uValllel^uAsnAlaAlaSerl^uAlaGlyThrHisGlyl^uValSerPheL^Val 
901 ACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCG 
TGGAGCATTATOAATTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGC 
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' PhePheCysPheAlaTrpTyrl^uLysGlyLy^^ 
9fil TGTTGTTCTGCTTTGCATGGTATTTGAAGGGTAAGTGOTTGCCCGGMOTGT^iJC^C^ 



AO^St^CGGCG^C^^^ 

ThrArgValGluAlaGliiI^uIttsValTrp^ 

1201 ssissssssssss^s^sssss- 

ASDAl»valIleI*uI*uMrtCyaUaV.lHisP^^ 


AsnHisI/suThrProI^uArgAspTrpAlaHisAsnGlyl^uArgAsp^^ 
1S01 ATAAC^TCT^CTCCTCTTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGG 

ValGluProValValPheSerGlnMetGluT^ 
I!Sc§«50CGs5w^ 



1801 


AspLysAsnGlnValGluGlyGlu^ 



1M1 *£^T^^ 

A^GTTGCA^AGWAC^CACGACCTGACAGATGGTGCCC 

AlaSerProLysGlyProVallleGlnMetTyrThrAsnValAspGlnAspLeuValGly 

FIG. 47-2 
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1921 TCGCGTCACCCAAGGGTCCTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGG 
AGCGCAGTGGGTTCCCAGGACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACC 

TrpProAlaProGlnGlySerArgSerLeuThrProCysThrCysGlySerSerAspLeu 
1981 GCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACC 
CGACCGGGCGAGGCGTTCCATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGG 

TyrLeuValThrArgHisAlaAspVallleProValArgArgArgGlyAspSerArgGly 
204 1 TTTACCTGGTCACGAGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGG 
AAATCGACCAGTGCTCCGTGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCC 

Serl^uLeuSerProArgProIleSerTyrLeuLysGlySexSerGlyGlyProLeuLeu 
2101 GCAGCCTGCTGTCGCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGT 
CGTCGGACGACAGCGGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACA 

CysProAlaGlyHisAlaValGlyllePheArgAlaAlaValCysThrArgGlyValAla 
2161 TGTGCCCCGCGGGGCACGCCGTGGGGATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGG 
ACACGGGGCGCCCCGTGCGGCACCCGTATAAATCCCGGCGCCACACGTGGGCACCTCACC 

LysAlaValAspPhelleProValGluAsnLeuGluThrThxMetArgSerProValPhe 
2221 CTAAGGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGT 
GATTCCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACA 

ThrAspAsnSerSerProProValValProGlnSerPheGlnValAlaHisLeuHisAla 
2281 TCACGGATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATG 
AGTGCCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTAC 

ProThrGlySerGlyLysSerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLys 
2341 CTCCCACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATA 
GAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATAT 

ValLeuValLeuAsnProSerValAlaAlaThrLeuGlyPheGlyAlaTyxMetSerLys 
2401 AGGTGCTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCA 
TCCACGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGT 

AlaHisGlylleAspProAsnlleArgThrGlyValArgThrlleThrThrGlySerPro 
2461 AGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCC 
TCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGG 

IleThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyr 
2521 CCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTT 
GGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAA 

AspIlellelleCysAspGluCysHisSerThrAspAlaThrSerlleLeuGlylleGly 
2581 ATCACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCG 
TACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGC 

ThrVall^uAspGlnMaGluThrAlaGlyAlaArgLeuValValLeuAlaThrAlaThr 
2641 GCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCA 
CGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGT 

ProProGlySerValThrValProHisProAsnlleGluGluValAlaLeuSerThrThr 
2701 CCCCTCCGGGCTCCGTCACTGTGGCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCA 
GGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGT 

GlyGluIleProPheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgHis 
2761 CCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGAC 
GGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTG 

LeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeu 
2821 ATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCAT 
TAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTA 

GlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVallleProThrSerGly 
2881 TGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCG 
ACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGC 

FIG. 47-3 
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AspValValValValMaThrAspMaLeuMetThrGlyTyrThrGlyAspPheAspSer 
2941 GCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACT 
CGCTACAACAGCAGCACCGTTCGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGA 

VallleAspCysAsnThrCysValThrGlnThrValAspPheSerLeuAspProThrPhe 
3001 CGGTGATAGACTGCAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCT 
GCCACTATCTGACGTTATGCACACAGTCGGTCTGTCAGCTAAAGTCGGAACTGGGATGGA 

ThrlleGluThrlleThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlyArg 
3061 TCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCA 
AGTGGTAACTCTGTTAGTGOGAGGGGGTCCTACGACAGAGGGCGrc 

ThrGlyArgGlyLysProGlylleTyrArgP^ 
3121 GGACTCGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCOG 
CCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGC 

MetPheAspSerSerValieuCysGluCysTyrAspMaGiyCysAlaTrpTyrGluLeu 
3181 GCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGC 
CGTACAAC^TGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTC^ 

ThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProVal 
3241 TCACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCG 
AGTGCGGGCGGCTCTGATGTCAATGCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGC 

CysGlnAspHisI^uGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAla • 
3301 TCTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATG 
ACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTAC 

HisPhel^uSerGlnThrLysGlnSerGlyGluAs^ 
3361 CCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACC 
GGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGG 

AlaThrValCysAlaArgAlaGlciAlaProProProSerTrpAspGlnMetTrpLysCys 
3421 AAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGT 
TTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCA 

Leul leArgLeuLysPrbThrLeuHisGlyProThr ProLeuLeuTyrArgLeuGlyAla 
3481 GTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCG 
CAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATAtGTCTGACCCGC 

ValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCysMetSer 
3541 CTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCA 

GACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTC 

AlaAspLeuGluValValttirSerThrt^ 
3601 CGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTT 
GCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAA 

AlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArgValValLeuSerGly 

3661 tggccgcgtattgcctgtcaacag<k:tgcgtggtcatagtgggcagggtc^ 

ACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGC 

LysProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGlu 
3721 GGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTOGATGAGATGGAAG 
CCTTCGGCCGTTAG TATGGACTGTCCCTTCAGG AGATGGCTCTCAAGCTACTCTACCTTC 

CysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGln 
3781 AGTGCTGTC^GCACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGC 
TCACX3AGAGTCGTGAATGGCATGTAGCTCGTTC 

LysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaProAlaVal 
3841 AGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTG 
TCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGAC 

GlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSer 
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3901 TCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCA 
AGGTCTGGTIGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGT 

GlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeu 
3961 GTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTCCTTCAT 
CACCCTATGTTATGMCCGCCCGMCAGTTGCGACGGAGCATTGGGGCGGTAACGAAGTA 

MetAlaPheThrAlaMaValThrSerProLeuThrThrSerGlnThrLeiiLeuPheAsn 
4021 TGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCA 
ACTACCGAAAATCTCGACGACAGTGGTCGGGTGATTC^ 

Ilel^uGLyGlyTrpValMaAlaGlnl^uAiaAlaProGlyAlaAlaTte 
4081 ACATATTGGGGGGGTCGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTAC 
TGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGC 

GlyMaGlyl^uMaGlyAlaMalleGlySerValGlyLeuGlyLysValLeuIleAsp 
4141 TGGGCGCTGGCTTAGCTGGCGGCGCCATCGGCAGTCTTOTACTGGGGAAGGTCCTCATAG 
ACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAAGCTGACCCCTTCCAGGAGTATC 

Ilel^uAlaGlyTyrGlyMaGl^alAlaGl^ 
4201 ACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGA 
TGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTOGAGAACACCGTAAGTTCTAGTACT 

GlyGluValProSerThrGluAspI^uValAsnLeuLeuProAlalleLeuSerProGly 
4261 GCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCG 
CGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGC 

AlaLeuValValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGlu 
4321 GAGCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCG 
CTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGC 

GlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
4381 AGGGGGCAGTGCAGTGGATGAAGCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTT 
TCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGIGAGG^CCCCCTTGGTACAAA 

ProThrHi sTyrVal ProGluSer AspAlaAlaAlaArgValThrAlalleLeuSerSer 
4441 CCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCA 
GGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGT 

LeuThrValThrGlnLeuLeiiArgArgLeuHisGlnTrpIleSerSerGluCysThrThr 
4501 GCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCA 
CGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGt 

ProCysSerGlySerTrpLeuArgAspIleTrpAspTrpIleCysGluValLeuSerAsp 
4561 CTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCG 
GAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGC 

PheLysThrTrpLeULysAlaLysLeuMetProGlnLeuProGlylleProPheValSer 
4621 ACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATC 

TGAAATTCTGGACCGATTTTCG ATTCGAGTACGGTCTCGACGGACCCtAGGGGAAAGACA 

CysGlnArgGlyTyrLysGlyValTrpArgValAspGlylleMetHisThrArgCysHis 
4681 CCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCC 
GGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGG 

CysGlyAlaGluIleThrGlyHisValLysAsnGlyThrMetArglleValGlyProArg 
4741 ACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTGGGTCCTA 
TGACACCTCGACTCTAGTGACCTGXACAGTTTTTGCCCTGCTACTCCTAGCAGCCAGGAT 

ThrCysArgAsnMetTrpSerGlyThrPheProIleAsi^ 
4801 GGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCT 
CCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGA 

ThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGluGluTyr 
4861 GTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAAT 
CATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTA 
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ValGluIleArgGlnValGlyAspPheHisTyrValThrGlyMetThrThrAspAsnLeu 
4921 ATGTGG AGATAAGG CAGG TGGGGGACTTCCACTACG TG ACGGG TATGACTACTG ACAATC 
TACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAG 

LysCysProCysGlnValProSerProGluPhePheThrGluLeuAspGl^ValArgLeu 
4981 TCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCC 
AGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGOGG 

HisArgPheAlaProProCysLysProLeuLeuArgGluGluValSerPheAr^ValGly 
5041 TACATAGGTTTGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAG 
ATGTATCCAAACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATC 

I^uHisGluTyrProValGlySerGlnLeuPro^^ 
5101 GACTCCACGAATACCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACOGGACGTGGCCG 
CTGAtteTGCTTATGGGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGC 

LeuThrSerMetLeuThrAspProSerHislleThrAlaGluAlaAlaGlyArgArgLeu 
5161 TGTTGACGTCCATGCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGT 
ACAACTGCAGGTACGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCA 

AlaArgGlySerProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSer 
5221 TGGCGAGGGGATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCAT 
ACCGCTCCCCTAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTA 

LeuLysMaThrCysThrAlaAsnHisAspSerProAspAlaGluLeuIleGluAlaAsn 
5281 CTCTCAAGGCAACTTGCACCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCA 
GAGAGTTCCGTTGAACGTGGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCOGGT 

LeuI^uTrpArgGlnGliiMetGlyGlyAsnlleThrArgValGluSerGluAsnLysVal 
5341 ACCTCCTATGGAGGCAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAG 
TGGAGGATACCTCCGTCCTCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTC 

Valllel^iiAspSerPheAspProI^uValMaGluGluAspGliiArgGluIleSerVal 
5401 TGGTGATTCTGGACTCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGOGGGAGATCTCOG 
ACCACTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGC 

ProAlaGluIleLeuArgLysSerArgArgPheAlaGlnAlaLeiiProValTrpAlaArg 
5461 TACCCGCAGAAATCCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGC 
ATGGGCGTCTTTAGGACGCCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCG 

ProAspTyrAsnProProLeuValGluThrTrpLysLysProAspTyrGluProProVal 
5521 GGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTG 
CCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGAC 

ValHisGlyCysProLeuProProProLysSerProProValProProProArgLysLys 
5581 TGGTCCATGGCTGTCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGA 
ACCAGGTACCGACAGGCGAAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCT 

ArgThrValValLeuThrGluSerThrLeuSerThrAlaLeuAlaGluLeuAlaThrArg 
5641 AGCGGACGGTGGTCCTCACTGAATCAACGCTATCTACTGCCTTGGCCGAGCTCGCCAOCA 
TCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGT 

SerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThrThrthrSerSerGlu 
5701 GAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTG 
CTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGAC 

ProAlaProSerGlyCysProProAspSerAspAlaGluSerTyrSerSerMetProPro 
5761 AGCCCGCCCCCTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCC 
TCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGG 

LeuGluGlyGluProGlyAspProAspI^iiSerAspGlySerTrpSerThrValSerSer 
5821 CCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCAGTA 
GGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCAGTCAT 

GluAlaAsnAlaGluAspValValCysCysSerMetSerTyrSerTrpThrGlyAlaLeu 
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5881 GTGAGGCCAACGCGGAGGAXGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGqAC 
CACTCCGGTTGCGCCTCCTAC AGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTG 

ValThrPrc<^sMaAlaGluGluGlnLyslAuProIleAsnAlaI^uSerAsnS«Leu 
5941 TCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCW^T^GT 
AGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCGTT6AGCA 

LeuArgHisHisAsnl^uValTyrSer^hrThrSerArgSerAlaCysGlnArgGlnLys 
6001 TGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAMGGCAGA 
ACGATGCAGTGGTGTTAAACGACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCT 

LysValThrPheAspArgl^uGlnValXeuAspSCT^ 
6061 AGAAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTAC» 

TeTTTCAGTGIAAACTGTCTGAdGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCC 

ValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerValGluGluAlaCysSer 
6121 AGGTTAAMCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAG*^ 

TCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCCTTCGAACGT 

i^uThrProProHisSerAlaLysSerLysPheGly^rGlyAlaLysAspValArg^s 
6181 GCCTGACGCCCCCACACTCAGCCAAATCCAAGTTT^TTATGGGGCAAAM 

CGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCCGTTTTCTGCAGGCAA 

HiaAlaArgl-ysAlaValThrHlsIleAsnSerValTrpLysAspLeuLeuGluAspAsn 
6241 GCCATGCCJVGAAAGGCCGTAACCCA(ATCAACTCCGTGTGGAAAGACCTTCTGGAAGACA 
CGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAGACCTTCTGT 

ValThrProIleAspThrThrlleMetAlaLysAsnGluValPheCysValGlnProGlu 
6301 ATGTAACACCAATAGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTG 
TACATTGTGGTTATCTGTGATGGTAGTAGCGATTGTTGCTCCAAAAGACGCAAGTCGGAC 

LysGlyGlyArgLysProAlaArgI«uIleValPheProAspI«uGlyValArgV^Cys 
6361 agaaggggggtcgtaagccagctcgtctcatcgtgttccccgatctgggcgtgcgcgtgt 
tcttccccccagcattcggtcgagcagagtagcacaaggggctagacccgcacgcgcaca 

GluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAlaValMetGlySerSer 
6421 GCGAAAAGATGGCTTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCT 
CGCTTTTCTACOGAAACATGCTGCACCAATGTTTCGAGGGGAACCGGCACTACCCTTCGA 

TyrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuValGlnAlaTrpLysSer; 
6481 CCTACGGATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGT 
GGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTCGCACCTTCA 

LysLysThrProMetGlyPheSerTyrAspThrArgCysPheAspSerThrValThrGlu 
6541 CCAAGAAAACCCCAATGGG€TTCTCGTA1GATACCCGCTGCTTTGACTCCACAGTCACTG 
GGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGTGTCAGTGAC 

SerAspIleArgThrGluGluAlalleTyrGlnCysCysAspLeuAspProGlnAlaArg 
6601 AGAGCGACATCCGTACGGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCC 
TCTCGCTCTAGGCATGCCTCCTCCGraAGATGGTTAGAACACTGGAGCTGGGGGTTCGGG 

ValAlalleLysSerLeuThrGluArgLeuTyrValGlyGlyProteuThrAsnSerArg 
6661 GCGTGGCCATCAAGTCCCTCACCGAGAGGCTTTATGTTGGGGGCCCTCTTACCAATTCAA 
CGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATACAACCCCCGGGAGAATGGTTAAGTT 

GlyGluAsnCysGlyTyrArgArgCysArgMaSerGlyVall^uThrThrSerCysGly 
6721 GGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTG 
CCCCCCICT1GACGCCGATAGCGTCCACGGCGCGCTCGCCGCATGACTGTTGATCGACAC 

AsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysArgAlaAlaGlyLeuGlnAsp 
6781 GTAACACCCTCACTTGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGC^GGGCTCCAGG 
CATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGTCGGACAGCTCGGCGTCCCGAGGTCC 

CysThi^tl^uValCysGlyAspAspLeuValVallleCysGluS^AlaGlyValGln 

6841 ACTG^CCATGCTCGTGTCTGGCGACGAC^^ 

TGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAGG 
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GluAspAlaAlaSerLeuArgMaPheTh^ 
6901 AGGAGGACGCGGGGAGCCTGAGAGCCTTCACGGAGGCTATGACCA6GTACTCC6CCCCCC 
ICCTCCTGCGCCGCTO^CTCT^^ 

GlyAspProProGlnProGluTyrAspteuGli^ 
6961 CTGGGGACCCCCCACAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAAra 
GACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTA^ 

SerValMaHisAspGlyAlaGlyLysArgValTyrtyrLeuThrArgAspP 
7021 TGTCAGTCGCCCACGACGGCGCTGGAAAGAGGGTC1AC11ACCTCACCGG 
ACAGTCAGCGGGTGCTGCCGCGACCTTTCTCCCAGATGMGG^ 

ProI^uAlaArgAlaMaTrpGluThr^ 
7081 CCCCCCTCGCGAGAGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAG 
GGGGGGAGCGCTCTCGAOGCACCCTCTGTCGTTCTC^ 

AsnllelleMetPheAlaProThrLeuTrpMaArgMetlleLeuMetThrHisPhePhe 
7141 GCAACATAATCATGTTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACC 

CGTTGTATTAGTAGAAACGGGGGTGTGACACCCGCTGCTACTATGACTACTGGGTAAAGA 

SerValLeuIleAlaArgAspGlnLeuGlu^^ 
7201 TTAGCGTCCTTATAGCCAGGGACCAGCTTGAACAGGCCCTOGATTGCGAGATCTACGGGG 
AATCGCAGGAATATCGGTCCCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATCCCCC 

CysTyrSerlleGluPrOLexiAspLeuProProIlelleGlnArgljeu 
7261 CCTGCTACTCCATAGAACCACTTGATCTACCTCCAATCATTCAAAGACTC 
GGACGATGAGGTATCTTGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAG 
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